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CHAPTER  1 
INTRODUCTION 

Classical  feedback  theory  has  proven  to  be  successful  in  the 
design  of  sing-ie-input  single-output  feedback  systems.  Recent  years  have 
seen  a  renewed  effort  to  generalize  this  theory  to  multiple-input  multiple- 
output  systems.  The  purpose  of  this  thesis  is  to  attempt  such  generaliza¬ 
tions  for  some  selected  topics  from  classical  theory.  Following  are  a  brief 
summary  of  each  chapter  and  a  description  of  how  the  chapters  relate  to  each 
other . 

The  purpose  of  Chapter  2  is  to  review  the  concepts  from  classical 
theory  for  which  generalizations  will  be  pursued  in  later  chapters.  The 
importance  of  each  concept  to  the  process  of  classical  design  is  discussed, 
thus  motivating  work  on  extensions.  Three  concepts  in  particular  are 
stressed. 

First  is  the  fact  that  certain  closed  loop  transfer  functions 
directly  express  the  quality  of  a  feedback  design.  Thus  it  is  natural  to 
state  design  specif ications  in  terms  of  these  transfer  functions.  Despite 
this  fact,  classical  methods  have  focussed  on  the  manipulation  of  gain  and 
phase  of  the  open  loop  transfer  function.  That  this  approach  is  successful 
is  due,  at  least  in  part,  to  a  second  important  characteristic  of  classical 
methods:  that  the  relation  between  open  and  closed  loop  system  properties 
is  well-understood.  Thus,  the  quality  of  a  design  may  be  evaluated  from 
knowledge  of  the  open  loop  transfer  function;  similarly,  design  specifica¬ 
tions  may  be  stated  in  terms  of  this  function. 


A  third  important  idea  from  classical  theory  is  that  there  exist 


hard  limitations  on  the  ability  of  a  linear  time-invariant  system  to 
achieve  design  specifications.  The  limitations  of  particular  interest  in 
j  this  thesis  are  those  known  as  the  Bode  integral  relations  and  Bode  gain- 

phase  relations.  These  limitations  manifest  themselves  as  tradeoffs  which 
must  be  performed  among  desirable  system  properties  in  different  frequency 
|  ranges.  Mathematically,  the  tradeoffs  are  due  to  the  fact  that  transfer 

functions  must  be  analytic  in  the  complex  frequency  variable.  This 
requirement  imposes  a  great  deal  of  structure  upon  transfer  functions;  it 
I  is  this  structure  that  allows  one  to  prove  that  not  all  design  goals  are 

achievable. 

The  Bode  relations  and  their  interpretation  in  terms  of  design 
tradeoffs  provide  the  scientific  foundation  for  classical  feedback  theory. 
One  way  of  viewing  classical  design  is  as  a  systematic  process  for  exploring 
the  set  of  designs  possible  given  the  various  limitations  due  to  the  open 
loop  plant  and  the  conflicts  among  design  specifications.  This  process 
involves  a  certain  amount  of  trial  and  error;  however,  various  rules  of 
thumb  are  available  to  guide  the  designer.  These  are  based  upon  knowledge 
of  the  relation  between  open  and  closed  loop  system  properties  and  the 
unalterable  properties  of  systems  given  by  the  Bode  integral  equations. 
Examples  of  rules  of  thumb  are  "high  loop  gain  yields  good  disturbance 
rejection"  and  "loop  roll-off  near  crossover  should  be  no  greater  than 
20  db/decade."  Now,  it  is  granted  that  such  rules  of  thumb  will  generally 
be  insufficient  to  carry  out  a  complete  system  design.  It  appears  their 
true  significance  is  that  they  allow  the  designer  to  rapidly  get  in  the 


vicinity  of  a  satisfactory  design,  after  which  a  more  detailed  analysis  must 
be  made.  For  this  reason,  generalization  of  classical  rules  of  thumb  to 
multiple  loop  systems  appears  to  be  a  worthy  endeavor.  Since  the  classical 
rules  of  thumb  discussed  above  could  each  be  given  a  theoretical  justifica¬ 
tion,  a  first  step  will  be  to  develop  the  necessary  theory  for  multiple 
loop  systems. 

One  advantage  of  working  with  open  loop  quantities  is  that  various 
design  limitations  due  to  properties  of  the  given  plant  are  seen  directly. 
Whether  open  or  closed  loop  transfer  functions  are  considered,  however,  it 
is  important  to  express  the  design  limitations  in  terms  of  the  transfer 
functions  evaluated  along  the  imaginary  axis,  for  this  is  where  design 
specifications  are  imposed.  Certain  design  limitations,  namely  those  due 
to  open  right  half  plane  poles  and  zeros  of  the  plant,  have  commonly  been 
considered  in  terms  of  their  effect  upon  the  gain  and  phase  of  the  open 
loop  transfer  function  along  the  imaginary  axis.  Recent  years  have  seen 
renewed  interest  in  expressing  these  limitations  in  terms  of  closed  loop 
transfer  functions.  Existing  results,  however,  have  only  been  stated  in 
terms  of  these  functions  evaluated  at  isolated  points  in  the  open  right 
half  plane.  Thus,  in  Chapter  3,  equivalent  statements  of  these  constraints 
are  given  in  terms  of  closed  loop  properties  along  the  imaginary  axis.  These 
statements,  which  are  very  much  in  the  spirit  of  the  Bode  integral  relations, 
show  that  tradeoffs  among  system  properties  in  different  frequency  ranges 
must  be  performed. 

Although  insight  into  design  limitations  may  be  gained  from  the 
integral  relations,  they  also  show  that  direct  design  in  terms  of  closed 
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loop  transfer  functions  is  complicated  by  the  necessity  of  satisfying  a 

f*5 

fairly  complicated  set  of  constraints.  This  observation  suggests  that  a 
criterion  of  merit  for  evaluating  proposed  design  methods  is  the  amount  of 
insight  provided  into  such  constraints.  Moreover,  the  ability  to  effectively  p 
perform  the  associated  tradeoffs  among  design  goals  should  also  be  present  in 
any  design  technique.  The  purpose  of  Chapter  4  is  to  discuss  the  relative  v 

merits  of  design  methods  employing  open  loop  transfer  functions  versus  those 
employing  closed  loop  transfer  functions.  In  particular,  it  is  conjectured 
that  open  loop  based  techniques  might  have  certain  advantages  in  dealing  with 
limitations  imposed  by  properties  of  the  plant.  It  should  be  stressed,  however,  ~ 
that  the  purpose  of  the  discussion  is  not  to  reach  any  conclusive  statements 

*w  * 

regarding  the  usefulness  of  any  particular  method.  Rather,  it  is  felt  that  a 
detailed  examination  of  the  issues  such  conjectures  raise  may  be  of  importance  •** 
in  the  development  of  design  techniques.  In  the  course  of  the  discussion  a 
class  of  recently  proposed  design  methods,  the  H^-optimization  techniques, 
are  analyzed  and  found,  for  at  least  some  problem  statements,  to  be  ^ 

equivalent  in  many  ways  to  other  closed  loop  based  methods.  It  is  seen  that 
the  integral  constraints  developed  in  Chapter  3  are  implicitly  present  in 
these  design  methods.  “ 

To  summarize.  Chapter  2  contains  a  review  of  known  classical  results 
and  points  out  some  highlights  of  the  classical  theory  and  methods.  In  ^ 

Chapter  3,  new  results  concerning  classical  design  problems  are  developed. 

These  results  are  applied,  in  Chapter  4,  to  discuss  properties  of  various 
design  schemes.  With  this  background,  sufficient  motivation  is  provided  for 


'  -  *.  c ■  *  •  •  *  •  -  -  •  •  o  . i  -« '  -  -  v  o  %  v.  •  v  jy 
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developing  extensions  of  the  concepts  described  in  Chapter  2  to  multiple 
loop  feedback  systems. 

A  discussion  of  some  previously  obtained  extensions  of  classical 
concepts  is  contained  in  Chapter  5.  (The  discussion,  however,  is  limited  to 
include  only  those  extensions  which  are  directly  relevant  to  the  purposes 
of  this  thesis.)  As  many  authors  have  discussed,  the  fact  that  certain 
closed  loop  transfer  functions  directly  express  the  quality  of  a  feedback 
design  holds  for  multiple  loop  as  well  as  single  loop  systems.  (One 
difference  is  that  some  nonstandard  functions  must  be  considered  when 
multiple  loop  phenomena  having  no  single  loop  analogue  are  encountered; 
see  Chapter  5  for  references  to  these  results.)  The  focus  of  this  thesis 
is  on  the  other  two  concepts  highlighted  in  Chapter  2,  since  it  appears  more 
work  is  needed  in  these  areas.  These  two  concepts  are  the  relation  of  open 
to  closed  loop  system  properties  and  the  tradeoffs  between  system  properties 
in  different  frequency  ranges  imposed  by  the  Bode  integrals.  It  is  shown 
in  Chapter  5  that  many  previous  extensions  of  these  concepts  hold  only  in 
special  cases,  for  which  the  multiple  loop  design  problem  essentially 
reduces  to  a  single  loop  problem.  This  reduction  occurs  when  system 
properties  and  design  goals  are  essentially  the  same  in  all  loops  of  the 
system.  Problem  formulations  are  then  given  in  which  this  simplifying 
assumption  is  not  satisfied.  In  this  way  Chapter  5  also  describes  the 
class  of  design  problems  for  which  theory  is  to  be  developed  in  later 
chapters. 

As  stated  previously,  classical  design  emphasized  shaping  the 
gain  and  phase  of  the  open  loop  transfer  function.  Although  the 
generalization  of  gain  to  multiple  loop  systems  is  fairly  straightforward. 
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the  generalization  of  phase  is  not.  In  addition,  multiple  loop  systems 
exhibit  directionality,  a  property  with  no  scalar  analogue.  The  purpose 
of  Chapter  6  is  to  discuss  what  role,  if  any,  phase  could  play  in  multiple 
loop  systems,  and  the  relation  of  both  gain  and  phase  with  the  direction¬ 
ality  properties  of  these  systems.  Emphasis  in  Chapter  6  is  placed  upon 
physical  interpretations  of  the  various  parameters. 

Another  way  of  viewing  the  gain  and  phase  of  a  scalar  transfer 
function  is  as  a  set  of  coordinates  completely  specifying  the  value  of 
the  function.  In  Chapter  7  sets  of  coordinates  for  matrix  transfer 
functions  are  discussed.  These  include  the  singular  values  and  the  measures 
of  phase  difference  discussed  in  Chapter  6.  Some  relations  between  these 
coordinates  and  feedback  properties  are  developed. 

In  Chapter  8  extensions  of  one  of  the  important  classical  concepts 
are  presented.  This  concept  is  that  rules  of  thumb  are  available  for 
relating  open  loop  to  closed  loop  properties  of  feedback  systems.  The 
novel  results  show  that  rules  of  thumb  may  be  obtained  for  systems  with 
both  large  and  small  levels  of  gain  in  different  directions  at  the  same 
frequency.  As  might  be  expected,  feedback  properties  are  a  function  of  a 
certain  measure  of  the  coupling  between  high  and  low  gain  portions  of 
the  system.  In  addition,  frequency  regions  analogous  to  gain  crossover 
frequency  in  a  single  loop  system  are  shown  to  exist. 

For  two-input  two-output  systems  the  results  of  Chapter  8  are 
expressed  in  the  coordinates  discussed  in  Chapters  6  and  7.  Two  examples 
are  then  presented.  One  is  an  example  of  a  physical  system.  This  example 
verifies  that  the  results  of  Chapter  8  are  indeed  useful  in  analyzing 
systems  of  engineering  interest. 


The  other  example  in  Chapter  8  was  specially  constructed  to 
reveal  some  behavior  in  multiple  loop  systems  which  appears  to  have  no 
analogue  in  scalar  systems.  In  particular,  the  limitations  imposed  by  the 
Bode  gain-phase  relations  appear  to  be  violated.  It  is  postulated  that 
this  behavior  is  due  to  a  helpful  Interaction  between  the  two  loops  of 
the  system.  The  fact  that  no  theoretical  foundation  for  understanding 
such  phenomena  exists  provides  motivation  for  the  major  work  of  this  thesis, 
which  is  to  develop  such  a  foundation. 

Theoretical  understanding  of  multiple  loop  systems  is  the  topic 
of  Chapters  9-12.  Chapter  9  initiates  this  work  by  first  analyzing  how 
analytic  function  theory  was  used  to  derive  constraints  on  scalar  transfer 
functions.  Although  the  Bode  integral  relations  expressed  these  constraints 
in  the  form  most  useful  to  engineers,  the  information  is  also  contained  in 
the  more  familiar  (to  mathematicians)  Cauchy-Riemann  equations  and  Cauchy 
Integral  Theorem.  All  these  formulas  show  that  the  two  coordinates  needed 
to  describe  the  value  of  a  transfer  function  provide  essentially  only 
one  degree  of  freedom  to  be  manipulated  in  design. 

Obviously  in  multiple  loop  systems  the  same  equations  may  be 
applied  to  each  element  of  a  matrix  transfer  function  written  in  standard 
coordinates.  Unfortunately,  these  coordinates  do  not  generally  convey 
useful  information  about  the  quality  of  a  feedback  design.  As  shown  in 
Chapters  6-8,  other  coordinates  exist  which  do  contain  such  information. 

It  is  not  clear,  however,  how  the  constraints  imposed  by  analyticity 
manifest  themselves  in  these  new  coordinates.  Thus  it  is  necessary  to 
study  how  the  basic  equations  of  complex  variable  theory  transform  under 


changes  of  coordinates  on  matrix  transfer  functions.  After  this  is  under¬ 
stood,  it  will  then  be  possible  to  express  the  Bode  gain-phase  relations 
in  useful  nonstandard  coordinates. 

Since  a  simple  characterization  of  analyticity  is  given  by 
the  Cauchy- Riemann  partial  differential  equations,  study  of  these  is 
undertaken  first.  It  is  shown,  using  differential  equations,  that  knowledge 
of  a  singular  value  gives  information  about  some  property  of  the  associated 
pair  of  singular  vectors.  The  fact  that  the  exact  nature  of  this  information 
is  unclear  motivates  a  study  of  how  singular  vectors  change  with  frequency. 

In  Chapter  9  two  special  classes  of  singular  vectors  are  singled 
out  for  study.  The  first  class  consists  of  those  vectors  for  which  the 
singular  subspaces  are  constant,  thus  allowing  scalar  ideas  about  the 
relation  of  gain  and  phase  to  be  generalized  directly.  The  second  class 
consists  of  so-called  "minimum-energy"  singular  vectors.  Thought  experiments 
are  discussed  which  suggest  that  these  singular  vectors  exhibit  motion  only 
of  the  singular  subspaces;  such  motion,  in  turn,  is  responsible  for  system 
behavior  having  no  analogue  in  single  loop  systems . 

Although  differential  equations  describing  the  motion  of  singular 
vectors  are  developed  in  Chapter  9,  the  relation  of  a  singular  vector  pair 
to  its  associated  singular  value  remains  difficult  to  analyze.  To  alleviate 
this  difficulty,  in  Chapter  10  a  study  of  the  space  in  which  singular  vectors 
lie  is  undertaken.  By  showing  that  this  space  has  the  structure  of  a  circle 
bundle  over  complex  projective  space,  it  is  possible  to  separate  the 
direction  of  a  singular  vector  (i.e.,  the  one-dimensional  subspace  in  which 


it  lies)  from  any  given  measure  of  its  phase.  Moreover,  questions  concerning 
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Che  exisCence  of  a  general  definition  of  phase  may  be  settled.  It  is  shown 
that  a  measure  of  phase  difference  between  a  pair  of  left  and  right  singular 
vectors  may  be  obtained  from  measures  of  the  phase  of  each  member  of  the 
pair. 

In  Chapter  11  the  geometric  framework  developed  in  Chapter  10  for 
separating  phase-like  properties  of  singular  vectors  from  directionality 
properties  is  used  to  analyze  the  differential  equations  derived  in 
Chapter  9.  It  is  shown  that  the  tangent  space  to  the  unit  sphere  in  ®n 
may  be  decomposed  into  the  direct  sum  of  two  spaces.  Each  of  these  in  turn 
corresponds  to  one  of  the  two  classes  of  singular  vectors  singled  out  in 
Chapter  9  for  further  study.  This  decomposition  of  the  tangent  space 
satisfies  the  definition  of  a  connection  in  a  principal  fiber  bundle. 

Now,  although  this  particular  description  of  a  connection  arises 
naturally  from  the  problem  under  study,  it  still  does  not  seem  to  provide 
much  insight.  To  remedy  this,  an  equivalent  description  of  the  connection 
is  developed  in  Chapter  11.  This  description  does  provide  insight  into  the 
distinction  between  changes  in  the  singular  subspaces  and  changes  in  the 
phase  difference  between  a  pair  of  singular  vectors.  In  addition,  it  is 
much  more  useful  in  performing  computations. 

One  result  of  Chapter  11  is  the  derivation  of  some  differential 
equations,  involving  a  singular  value  and  a  measure  of  phase  difference 
between  its  associated  pair  of  singular  vectors,  which  are  analogous  to  the 


familiar  Cauchy-Riemann  equations.  It  turns  out,  moreover,  that  the  functions 
of  interest  generally  do  not  satisfy  these  familar  equations.  The  amount  by 
which  the  Cauchy-Riemann  equations  fail  to  be  satisfied  may  be  quantified 
by  the  difference  between  two  connection  one-forms  evaluated  on  tangent 
vectors  to  the  paths  through  complex  projective  space  which  the  singular 
vectors  are  constrained  to  traverse.  By  assigning  coordinates  to  projective 
space,  considerable  insight  is  gained  into  the  effects  of  nonconstant 
singular  subspaces. 

The  final  theoretical  results  of  this  thesis  are  contained  in 
Chapter  12.  In  this  chapter,  the  generalized  Cauchy-Riemann  equations 
developed  in  Chapters  9-11  are  used  in  conjunction  with  Stokes'  Theorem  to 
derive  a  generalization  of  the  Bode  gain-phase  relations  to  multiple- loop 
systems.  The  generalization  is  applicable  to  functions  of  the  form  ae^AS, 
where  o  is  a  singular  value  and  A9  is  an  appropriate  measure  of  phase 
difference  between  the  associated  pair  of  singular  vectors. 

The  generalized  gain-phase  relations  show  how  the  value  of  A9(ju) 
may  be  expressed  as  the  sum  of  two  integrals.  The  first  integral  is  a 
function  of  the  rate  at  which  the  singular  value  rolls  off  in  the  vicinity 
of  s  *  jur,  and  is  entirely  analogous  to  the  classical  Bode  integral  relating 
the  gain  and  phase  of  a  single  loop  transfer  function.  The  second  integral, 
on  the  other  hand,  has  no  analogue  in  single  loop  systems.  Interpretations 
of  this  term  are  also  presented.  These  stress  that  the  discrepancy  between 
the  actual  value  of  A9(jw)  and  the  value  of  the  first  integral  is  due  to 
phase  lead  or  lag  being  transferred  among  loops  of  the  system. 


Finally,  in  Chapter  13  the  new  integral  relation  developed  in 
Chapter  12  is  applied  to  analyze  the  example  presented  in  Section  8.4. 
was  pointed  out  in  the  original  discussion  of  this  example  that  its 
anomalous  behavior  appeared  to  be  due  to  a  transfer  of  phase  lead  from 
the  first  to  the  second  loop  of  the  system.  The  new  gain-phase  relation 
allows  this  conjecture  to  be,  if  not  verified,  at  least  made  quite 
plausible. 

Concluding  remarks  are  contained  in  Chapter  14. 


CHAPTER  2 


REVIEW  OF  CLASSICAL  FEEDBACK  CONCEPTS 

2.1.  Introduction  and  Motivation 

The  purpose  of  this  chapter  is  to  review  classical  feedback  theory 
for  linear  time-invariant  single-input  single-output  (SISO)  control  systems. 
The  review  is  not  intended  to  be  complete.  Rather  some  important  concepts 
will  be  highlighted  which  have  proved  particularly  useful  in  ascertaining 
design  limitations.  Extension  of  these  concepts  to  multiple-input  multiple- 
output  (MIMO)  systems  will  be  pursued  in  later  chapters  of  this  thesis. 

It  should  be  emphasized  that  the  purpose  of  this  chapter  is  not  to 
present  new  results.  (New  results  in  single  loop  feedback  theory  are  given 
in  Chapter  3.)  Rather,  in  this  chapter  interpretations  of  existing  results 
are  presented,  and  a  paradigm  is  seen  to  emerge.  Three  features  of  classical 
design  are  discussed  in  this  chapter;  a  summary  of  these  features  was 
presented  in  Chapter  1.  Additional  discussion  of  classical  design  techniques 
is  contained  in  Chapter  4. 

One  reason  to  carefully  reexamine  classical  theory  before  attempt¬ 
ing  MIMO  extensions  is  to  insure  that  one  is  actually  working  with  the  key 
concepts,  rather  than  concepts  whose  seeming  importance  is  only  illusory. 

To  illustrate,  recall  that  attempts  to  generalize  the  notion  of  gain  and 
phase  margins  to  MIMO  systems  met  with  difficulty.  Extending  the  notion  of 
the  closest  distance  from  the  Nyquist  locus  to  the  critical  point  proved  to 
be  straightforward,  however.  Further  examination,  (e.g.,  [1,  p.  1182]), 
showed  that  in  fact  gain  and  phase  margins  were  only  useful  in  assessing 
worst  case  robustness  insofar  as  they  approximated  this  closest  distance. 


Thus  the  difficulty  in  discovering  MIMO  gain  and  phase  margins  useful  in 
worst  case  analysis  is  perhaps  not  surprising;  it  might  be  considered  a 
fortuitous  accident  that  these  margins  are  even  useful  in  SISO  systems. 

Another  reason  to  discuss  SISO  systems  is  to  point  out  certain 
properties  of  these  systems  which  imposed  limitations  and  tradeoffs  in 
design.  Later  these  properties  will  be  examined  in  the  context  of  MIMO 
systems  to  see  if  these  limitations  are  still  present,  or  whether  the 
additional  degrees  of  freedom  available  in  MIMO  design  allow  :  iem  to  be 
overcome  or  at  least  allow  tradeoffs  to  be  made  in  a  different  manner. 


2.2.  Basic  Equations  for  Feedback  Systems 

Consider  a  linear  time-invariant  single-input  single-output 
feedback  system  as  shown  in  Figure  2.1. 

Here  P(s)  and  F(s)  are  the  transfer  functions  of  the  plant  model 
and  feedback  compensator,  respectively.  The  signal  r(s)  is  the  reference 
input,  y(s)  the  system  output,  d(s)  a  disturbance  input,  and  n(s)  is  sensor 
noise. 

The  output  of  the  system  is  given  by 

y(s)  -  yrCs)  +  yd(s)  +  yn(s)  (2. 

yf(s)  =  [1  +  P(s)F(s) P(s)r(s)  (2. 

yd(s)  =  [1  +  P(s)F(s) ]  1  d(s)  (2. 


y  (s)  =*  -[1  +  P(s)F(s)]P(s)F(s)n(s) 


(2. 
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Define  the  open  loop  transfer  function 


L(s)  =*  P Cs ) F Cs ) 


the  system  sensitivity  function 


S(s)  -  [1  +  L(s)]"1 


and  the  complementary  sensitivity  function  [  2] 


T(s)  *  [1  +  L(.s)  ]  L(s) 


(2.5) 


(2.6) 


(2.7) 


Assume  that  L(s)  is  free  of  unstable  hidden  modes.  Then  the 
feedback  system  is  stable  if  S(s)  is  bounded  in  the  closed  right  half  plane. 
Note  this  assumption  on  L(s)  implies  that  the  closed  right  half  plane  poles 
and  zeros  of  the  plant  and  compensator  must  appear  with  at  least  the  same 
multiplicity  as  that  in  L(s) . 

From  (2.3)  and  (2.6)  it  follows  that  the  response  of  the  system  to 
disturbance  inputs  is  determined  by  the  magnitude  of  the  sensitivity  function, 
|S(ju))|.  Similarly,  from  (2.4)  and  (2.7)  it  follows  that  the  response  of 
the  system  to  sensor  noise  is  determined  by  the  magnitude  of  the  complementary 
sensitivity  function,  | T( joi)  |  .  Therefore,  the  response  of  the  system  to 
disturbances  of  frequency  cu  can  be  made  small  by  requiring  that  | S (ju) ] <<1 ; 
the  response  to  sensor  noise  can  be  made  small  by  requiring  that  | T(jw) [ <<1 . 

The  benefits  of  feedback  in  reducing  the  effects  of  uncertainty 


in  the  plant  model  upon  the  system  output  can  be  assessed  from  [3] 


E  (s)  S  S ' (s)E  (s) 
c  0 


S ' (s)  -  [1  +P'(s)F(s)]“1 


(2.8) 


(2.9) 


■v.  ;:-.v 

I O  -V.  4.’  V 


16 


where  P'(s)  =  [1  +  A(s)]P(s)  is  the  true  plant.  The  signals  Ec(s)  and  Eq(s) 


are  the  deviations  in  the  outputs  of  nominally  equivalent  closed  and  open 
loop  systems  caused  by  the  model  error  A(s).  If  at  some  frequency 


S'(jo»)|<  1 


(2.10) 


then  the  closed  loop  system  is  said  to  possess  the  sensitivity  reduction 
property  at  that  frequency.  Condition  (2.10)  is  difficult  to  design  for 
since  it  depends  on  the  true  unknown  plant  rather  than  the  model. 

Nonetheless,  at  frequencies  for  which  |  A(ju>)  |  <  l,it  is  possible  [4]  to 
insure  that  (2.10)  holds  by  requiring  that  |S(ja>)j<<l.  Thus  it  is 
possible  to  reduce  the  effects  of  sufficiently  small  levels  of  plant 
uncertainty  by  requiring  the  nominal  sensitivity  function  to  have  small 
magnitude. 

The  sensitivity  and  complementary  sensitivity  functions  are  also 
related  to  the  stability  robustness  properties  of  the  system  (see,  e.g.,  [5], 
Table  1).  In  particular,  suppose  that  the  true  plant  is  given,  as  above,  by 


P'(s)  -  [1  +  A(s)]P(s) 


(2.11) 


Assume  that 

(i)  P'(s)  and  P(s)  have  the  same  number  of  unstable  poles  and 

(ii)  A(s)  is  otherwise  arbitrary  subject  to  a  frequency  dependent 
magnitude  bound: 


|  A(j a)  |  <  M,  (jj)  Vijj 


(2.12) 


Then  if  the  nominal  system  (1(3)  5  0)  is  stable,  the  true  system  is 
guaranteed  to  be  stable  if  [4] 


.  j 
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ma(u3)  <  iKjdi) 


Voi 


(2.13) 


This  bound  requires  that  | T ( j to)  |  must  be  small  at  frequencies  for  which 
uncertainty  may  be  large. 


2.3.  Closed  Loop  Transfer  Functions  and  Feedback  Properties 

The  preceding  discussion  has  highlighted  one  important  aspect  of 
classical  feedback  theory.  Properties  of  feedback  systems  which  are 
important  in  design  are  expressed  directly  by  various  closed  loop  transfer 
functions.  This  fact  motivates  the  statement  of  design  specifications  in 
terms  of  frequency  dependent  bounds  on  |S(ja))|  and  j  T  ( j  to)  |  : 

|S(j«o)|  <  Mg(uj)  Vw  (2.14) 

| T(jw)  |  <  M^oj)  Vu)  .  (2.15) 

These  bounds  will  generally  be  a  function  of  the  relative  levels  of 
disturbance  inputs,  sensor  noise,  and  plant  uncertainty.  Note,  however, 
that  both  |  S (J u»)  j  and  |T(jw)|  cannot  be  small  at  the  same  frequency. 

This  fact  follows  from  the  identity 

S  ( jtu)  +  T(joj)  =  1  (2.16) 

Thus  there  exists,  at  each  frequency,  an  algebraic  tradeoff  between  the 
system  properties  of  sensitivity  reduction  and  disturbance  rejection  (which 
are  achieved  by  requiring  ! S (jw) \ << 1)  and  the  properties  of  stability  robust¬ 
ness  and  sensor  noise  rejection  (achieved  by  requiring  | T( j u.=)  j  < <1)  . 
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In  engineering  applications  it  commonly  happens  that  levels  of 


uncertainty  and  sensor  noise  become  large  at  high  frequencies.  Disturbance 
rejection  and  nominal  sensitivity  reduction  are  generally  desired  over  a 
low  frequency  band  containing  the  frequency  range  of  the  reference  input. 
Thus  the  algebraic  tradeoff  imposed  by  (2.16)  is  typically  performed  by 
requiring  Mg(uj)  to  be  small  at  low  frequencies  and  to  be  small  at 

high  frequencies.  In  addition  neither  Mg  (tu)  nor  M^to)  should  be 
excessively  large  at  any  frequency. 


2.4.  Design  Limitations  Imposed  by  Bode's  Integral  Theorem 

There  exist  tradeoffs  in  feedback  design  in  addition  to  those  at 
each  frequency  imposed  by  (.2.16).  These  tradeoffs  must  be  performed 
between  system  properties  in  different  frequency  ranges,  and  are  a 
consequence  of  physical  realizability.  In  order  to  be  physically  realizable, 
a  function  of  the  complex  frequency  variable  "s"  must  in  fact  be 
analytic  in  this  variable;  this  follows  from  well-known  properties  of  the 
Laplace  transform.  Thus  ,  complex  variable  theory  can  be  applied  to  study 
properties  of  feedback  systems  expressed  by  both  open  and  closed  loop 
transfer  functions.  One  such  application  of  the  theory  is  the  Nvquist 
stability  criterion  [6] .  The  use  of  complex  variable  theory  to  study 
limitations  in  the  design  of  feedback  systems  was  pioneered  by  Bode  [7] 
after  some  early  work  by  Norbert  Wiener  and  students  [8].  Interpretations 
of  3ode's  work  in  the  context  of  control  design  were  emphasized  by  Horowitz 


The  tradeoffs  analyzed  by  Bode  are  quantified  by  certain  integral 
relations  which  tnust  be  satisfied  by  transfer  functions  of  linear  time 
invariant  systems.  One  such  tradeoff  occurs  among  levels  of  sensitivity 
reduction  in  different  frequency  ranges.  The  following  theorem  was  first 
proven  by  Bode;  an  extension  to  open  loop  unstable  systems  is  presented  in 
Chapter  3  of  this  thesis. 

Theorem  2.1  (Bode  Integral  Theorem  T7]):  Assume  that  the  open  loop  transfer 
function  L(s)  possesses  no  poles  in  the  open  right  half  plane.  In  addition, 
assume  that 

lim  sup  R|L(s)i  -  0  .  (2.17) 

R  ->«>  |  s  |  >  R 

Re[s]  >  0 

Then, If  the  closed  loop  system  is  stable, the  sensitivity  function  must 
satisfy 

oo 

0  =  /  log  I  S(jw)  I  dto  .  (2.18) 

0 

■ 

For  a  complete  discussion  of  the  design  implications  of  Theorem  2.1, 
see  Chapter  3.  For  now,  note  that  (2.18)  shows  that  on  a  plot  of 
log  |  S  (jtu)  |  vs  to  the  sensitivity  reduction  area  ( |  S ( jtu)  |  <1 )  must  equal  the 
area  of  sensitivity  increase  (|S(jw)|>l)  in  units  of  decibels  *  (radians/sec). 
Thus,  the  desirability  of  sensitivity  reduction  over,  say,  a  low  frequency 
range  must  be  traded  off  against  possibly  undesirable  effects  of  sensitivitv 
increase  at  other  frequencies.  When  realistic  bandwidth  constraints  are 
imposed  it  can  be  shown  (Chapter  3)  that  this  tradeoff  must  take  place 
primarily  over  a  low  frequency  interval,  despite  the  fact  that  the  range 
of  integration  in  (2.18)  is  infinite. 


Theorem  2.1  illustrates  another  important  theme  in  classical 
feedback  theory.  This  theme  is  that  there  exist  hard  limitations  on  the 
achievable  performance  of  any  linear  time-invariant  feedback  system.  In  the 
present  case, (2.18)  shows  that  if  L(s)  must  satisfy  (2.17),  then  not  all 
functions  S(s)  can  be  realized  as  the  sensitivity  function  of  the  closed 
loop  system.  In  Chapter  3  it  is  shown  that  relations  similar  to  (2.18) 
must  be  satisfied  by  S(s)  when  the  plant  is  nonminimum  phase  and  by  T(s) 
when  the  plant  is  open  loop  unstable.  In  all  these  cases  fixed  properties 
of  the  open  loop  system  restrict  the  achievable  properties  of  the  closed 
loop  system.  This  may  be  one  reason  why  classical  design  methods  focussed 
upon  shaping  the  gain  and  phase  of  the  open  loop  transfer  function  despite 
the  fact  that  it  is  the  closed  loop  transfer  function  which  directly 
expresses  the  quality  of  a  feedback  design.  (Further  discussion  of  this 
point  is  given  by  Horowitz  [9,  Sec.  6.13].  An  extensive  discussion  is  also 
presented  in  Chapter  4  of  this  thesis.) 


2.5.  Relations  Between  Open  and  Closed  Loop  System  Properties 

In  any  case,  certainly  one  reason  why  classical  loop  shaping  is 
successful  is  the  fact  that  the  relation  between  gain  and  phase  of  L(jw)  and 
closed  loop  properties  is  well  understood. 

From  (2.6)  and  (2.7)  the  following  relations  between  levels  of 


open  loop  gain  and  closed  loop  properties  can  be  deduced: 


|  L(joj)  j  »1 


(2.19) 


|sCjw)  |«1 

and 

| T (juj)  |  *=1 

and 

|T(jio)l«l 

|L(ju)|«l  and  (2.20) 

|S(jw)|*l 

At  frequencies  for  which  the  loop  gain  is  approximately  unity,  closed  loop 
properties  can  be  approximated  from  knowledge  of  open  loop  phase: 

|l(j»)|«i  |s(»|»i 

and  «■*  and  (2.21) 

*L(ju))*±  180°  |T(jw)|»l 

Approximations  (2.19)-(2.21)  yield  well-known  rules  of  thumb  for 
classical  design.  Large  loop  gain  is  required  for  good  sensitivity  reduction 
and  disturbance  rejection  properties,  while  small  loop  gain  is  required  for 
sensor  noise  rejection  and  stability  robustness.  At  frequencies  for  which 
|L(jw)|*,l  (e.g.,  gain  crossover  frequency,  at  which  |L(ju>)  |  *  1),  the  phase  of 
the  system  should  remain  bounded  sufficiently  far  away  from  ±180°  to  provide 
an  adequate  stability  margin  and  to  prevent  amplifying  both  disturbance 
inputs  and  sensor  noise. 

Recall  the  discussion  of  the  sensitivity  and  complementary  sensitiv¬ 
ity  specifications  (2.14)  and  (2.15).  From  that  description  of  typical  closed 
loop  design  specifications  and  the  approximations  (2. 19) -(2. 21)  it  follows  that 
corresponding  open  loop  gain  and  phase  design  specifications  might  appear  as  in 
Figure  2.2.  These  specifications  reflect  the  fact  that  loop  gains  are  desired 
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to  be  high  at  low  frequencies  for  good  system  performance  and  are  required  to 

be  small  at  high  frequencies  in  order  to  maintain  stability  robustness.  At 

intermediate  frequencies  the  phase  of  the  system  must  be  bounded  away  from 

±180°.  To  maintain  good  sensitivity  and  disturbance  rejection  properties  over 

as  wide  a  frequency  range  as  possible,  ujt  should  be  close  to  w_. 

L  H 


2.6.  Bode  Gain-Phase  Relations 

The  ability  to  achieve  the  specification  just  described  is 
constrained  by  the  fact  that  gain  and  phase  cannot  be  assigned  independently 
in  design.  The  precise  relation  between  gain  and  phase  was  stated  in  its 
most  useful  form  by  Bode  [7].  Again,  Horowitz  [9]  has  discussed  the 
implications  for  control  design.  A  good  recent  discussion  is  given  by 
Doyle  and  Stein  [4]. 

Theorem  2.2  (Bode  Gain-Phase  Relation):  Assume  that  L(s)  has  no  poles  or 
zeros  in  the  closed  right  half  plane;  in  addition,  assume  that  L(s)  has 
a  power  series  expansion  about  the  point  at  00 : 


oo  c 

L(s)  *  £  — 

i-0  s 


(2.22) 


Define 


v  ■  log(u)/w  ) 
o 


At  each  point  s  =  ju>  the  following  integral  relation  holds: 


*UJ“0)  -  £  /  d  {iog  coth  dv 


(2.23) 


(2.24) 


The  above  theorem  gives  conditions  which,  if  satisfied,  show  that 
knowledge  of  |L(juj)|  Vu  (which  implies  knowledge  of  —  1)  completely 

determines  *L(jto).  Thus,  although  two  parameters  or  coordinates  are  needed 
to  specify  the  value  of  a  transfer  function,  for  stable  minimum  phase  systems 
there  exists  only  one  degree  of  freedom  available  to  the  designer.  The 
presence  of  the  weighting  function  (Figure  2.3) 

til  +  us 


log  coth 


M  . 


log 


0)  -  0) 


(2.25) 


shows,  however,  that  *L(jajQ)  depends  primarily  on  values  of  — 
at  frequencies  near  oj^. 

The  implication  of  Theorem  2.2  for  the  design  specification  of 
Figure  2.2  is  as  follows.  If  the  gain  is  decreasing  at  20N  db/decade  at 
frequencies  near  o)q,  then  the  phase  is  approximately  -90N°.  Thus,  in 
Figure  2.2  u>L  must  be  sufficiently  less  than  so  that  the  rate  of  gain 
decrease  near  gain  crossover  frequency  is  not  much  more  than  20  db/decade, 
yielding  a  phase  lag  of  not  much  more  than  -90°.  Low  frequency  sensitivity 
reduction  must  be  traded  off  against  a  sufficient  stability  margin  near 
crossover.  This  is  another  example  of  a  tradeoff  which  must  be  performed 
among  system  properties  in  different  frequency  ranges.  Again,  the  tradeoff 
is  due  to  the  analyticity  of  transfer  functions  of  physical  systems. 

One  difference  between  this  tradeoff  and  that  discussed  in  Section  2.4  is 


that  the  latter  tradeoff  was  expressed  in  terms  of  a  closed  rather  than  an 
open  loop  transfer  function. 
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2.7.  Summary 


To  summarize  this  chapter,  several  key  concepts  from  classical 
feedback  theory  have  been  discussed.  First,  feedback  properties  can  be 
assessed  from  various  closed  loop  transfer  functions.  Second,  despite 
this  fact,  classical  design  focussed  on  open  loop  quantities;  this  procedure 
was  successful  since  the  approximate  relation  between  open  and  closed  loop 
quantities  is  well  understood.  Third,  properties  of  linear  time-invariant 
systems  constrain  the  class  of  achievable  design  specifications,  whether 
these  are  imposed  upon  open  or  closed  loop  quantities,  and  impose  tradeoffs 
in  feedback  design.  Recognition  of  the  existence  of  these  tradeoffs,  and 
the  ability  to  deal  insightfully  with  them  in  design,  are  crucial  to  the 
success  of  classical  feedback  theory.  Tradeoffs  can  be  seen  to  exist  from 
classical  Nyquist/Bode  analysis  techniques.  Well-known  rules  of  thumb 
associated  with  classical  lead  and  lag  compensation  schemes  show  that  trade¬ 
offs  cannot  be  avoided.  However,  these  techniques  are  useful  in  exploring 
the  set  of  possible  designs  and  achieving  a  satisfactory  compromise  among 
design  goals. 

In  the  next  chapter  some  new  results  concerning  tradeoffs  in 
SISO  systems  which  are  open  loop  unstable  and/or  nonminimum  phase  are 


presented. 


CHAPTER  3 


RIGHT  HALF  PLANE  POLES  AND  ZEROS  AND  DESIGN 
TRADEOFFS  IN  FEEDBACK  SYSTEMS 

3.1.  Introduction 

A  central  issue  in  the  design  of  feedback  systems  is  that  of 
sensitivity  of  the  closed  loop  system  to  uncertainty  in  the  plant  model 
and  to  disturbance  inputs.  The  system  sensitivity  function,  denoted  S(s), 
has  played  a  key  role  in  the  classical  design  and  theory  of  feedback  systems; 
its  importance  has  been  discussed  by  many  authors  [2-4,7,9-19].  Briefly,  the 
magnitude  of  the  sensitivity  function  evaluated  along  the  jw-axis  directly 
quantifies  such  feedback  properties  as  output  disturbance  rejection  and 
sensitivity  to  small  parameter  variations. 

Another  function  which  expresses  important  feedback  properties  is 
the  complementary  sensitivity  function  [2],  defined  as  T(s)  =  l-S(s).  The 
magnitude  of  T(s)  along  the  joj-axis  quantifies  the  response  of  the  feedback 
system  to  sensor  noise.  In  addition,  this  quantity  has  recently  been  used 
as  a  measure  of  the  stability  margin  [4,11-14,20]. 

The  importance  of  |T(ju>)|  and  |S(ju>)|  to  design  properties  motivates 
the  expression  of  design  limitations  imposed  by  the  open  loop  transfer  function 
directly  in  terms  of  these  quantities.  For  example,  a  well-known  theorem  of 
Bode  [7,9]  states  that  for  stable  open  loop  transfer  functions  with  greater 
than  one  pole  roll-off  the  integral  overall  frequencies  of  the  log  magnitude 
of  the  sensitivity  function  must  equal  zero.  In  the  presence  of  bandwidth 
limitations  this  imposes  a  design  tradeoff  among  system  sensitivity  properties 
in  different  frequency  ranges  ([9]  and  Section  3.3  below). 
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It  has  not  been  common,  however,  to  formulate  other  limitations 
explicitly  in  terms  of  {  S ( j ou)  J  and  | T ( j to >  [ .  For  example,  it  has  long  be; 
recognized  that  the  presence  of  right  half  plane  poles  and  zeros  in  the  open 
loop  transfer  function  imposes  limitations  upon  the  design  of  feedback  systems. 
These  limitations  are  frequently  expressed  in  terms  of  the  effect  on  the  phase 
of  the  open  loop  transfer  function.  Suppose  the  plant  is  nonminimum  phase. 

Then, using  classical  analysis  techniques.it  can  be  seen  qualitatively  that 
requiring  |S(jui)|  to  be  less  than  one  over  some  frequency  interval  implies  that 
|S(ju))  |  is  greater  than  one  elsewhere.  This  fact  is  proven  by  Francis  and 
Zames  [21,  Theorem  3].  These  authors  show  that  if  the  plant  has  a  right  half 
plane  zero,  then  requiring  |S(joi)|  to  be  arbitrarily  small  over  some  interval 
forces  |S(ju)|  to  be  arbitrarily  large  elsewhere. 

Despite  the  importance  of  |S(ju>)|  and  jT(jw)|  as  measures  of  design 
quality,  it  has  been  more  common  to  express  limitations  due  to  open  right  half 
plane  poles  and  zeros  as  constraining  the  values  of  S(s)  and  T(s)  at  isolated 
points  away  from  the  jui-axis  [2,15,16,20-22].  The  purpose  of  this  chapter  is  to 
present  equivalent  statements  of  the  right  half  plane  pole  and  zero  constraints 
in  terms  of  integral  relations  which  must  be  satisfied  by  |S(ju))|  and  |T(ju)|. 
These  constraints  show  that  desirable  properties  of  the  sensitivity  and 
complementary  sensitivity  functions  in  one  frequency  range  must  be  traded  off 
against  undesirable  properties  at  other  frequencies.  These  tradeoffs  are  a 
direct  consequence  of  properties  of  linear  time  invariant  systems.  Thus  the 
limitations  discussed  in  this  chapter  are  independent  of  any  particular 
choice  of  design  method. 
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The  remainder  of  this  chapter  is  organized  as  follows:  Section  3.2 
is  devoted  to  the  derivation  of  the  integral  relations  from  the  right  half 
plane  pole  and  zero  constraints.  In  Section  3.3  Bode's  integral  theorem, 
referred  to  above,  is  extended  to  open  loop  unstable  plants  and  consequences 
for  feedback  design  are  discussed.  Section  3.4  contains  a  discussion  of  the 
limitations  imposed  upon  system  sensitivity  properties  by  the  integral 
constraints  due  to  open  right  half  plane  zeros.  Section  3.5  contains  a 
similar  discussion  of  the  limitations  imposed  upon  the  complementary  sensitiv¬ 
ity  function  by  unstable  open  loop  poles.  The  effect  of  the  relative  location 
of  right  half  plane  poles  and  zeros  to  frequency  ranges  of  interest  is  discussed 
in  Section  3.6.  Some  brief  remarks  on  limitations  in  multivariable  systems  are 
found  in  Section  3.7.  The  chapter  is  summarized  in  Section  3.8.  Results  in 
this  chapter  have  appeared  in  [23-25]. 


3.2.  Right  Half  Plane  Pole  and  Zero  Constraints 

Consider  the  linear  time-invariant  feedback  system  of  Figure  3.1. 
Let  the  transfer  functions  of  the  plant  model  and  the  feedback  compensator 
be  denoted  P(s)  and  F(s),  respectively.  The  open  loop  transfer  function  is 
given  by 


L(s)  S  P(s)F(s) 


(3.1) 


The  sensitivity  function  of  this  system  is 


SCs) 


1  +  L(s) 


(3.2) 


.  r'r  '/ 

‘~.r~  V  -  A.  -  I 


. 


£ 


*  *1 

7*1 


*  nili  * 


and  Che  complementary  sensitivity  function  [2]  is 

T (s)  -  1-  S(s) 

.  Jd§L- 

1  +  L(s) 


(3.3) 


The  response  of  the  system  of  Figure  3.1  to  disturbance  inputs  is 

given  by 

yd(s)  *  S(s)d(s) 
and  the  response  to  sensor  noise  is  given  by 

yn(s)  =  -  T(s)n(s) 

From  these  equations  it  is  seen  that,  at  a  particular  frequency,  the  effect 
of  disturbances  can  be  reduced  by  requiring  that  |S(jw)|  <1  at  that  frequency. 
Similarly,  requiring  |  T ( j ui)  |  <1  leads  to  a  reduction  in  the  effects  of  sensor 
noise  at  that  frequency.  Since  S ( j to)  +  T ( j to)  =  1  there  is  a  well-known 
tradeoff  between  the  two  types  of  response  at  a  given  frequency.  The  integral 
relations  to  be  derived  in  this  section  and  Section  3.3  reveal  that  there  also 
exist  tradeoffs  among  feedback  properties  at  different  frequencies. 

Assume  that  L(s)  is  free  of  unstable  hidden  modes.  Then  the  feedback 
system  is  stable  if  S(s)  is  bounded  in  the  closed  right  half  plane.  Note  this 
assumption  on  L(s)  implies  that  the  closed  right  half  plane  poles  and  zeros  of 
the  plant  and  compensator  must  appear  with  at  least  the  same  multiplicity  in 
L(s)  . 


Assume  also  that  L(s)  can  be  factored  as 


(3.5) 


The  term  e 


,  x  >  0,  represents  a  time  delay  if  x  >  0. 
N 

2  Z.  -S 

b  (s)  =  n  z~- 

2  i=i  z^+s 


The  term 


is  the  Blaschke  product  of  open  right  half  plane  zeros,  including 
multiplicities : 


Jrm  { ;  i-i . nz> 


(3.6) 


Similarly, 


N 

p  p.-s 
B  (s)  =  n  err— 
p  i-i  Pi+S 


(3.7) 


is  the  Blaschke  product  of  open  right  half  plane  poles,  including 
multiplicities : 


P  -  ;  i=l,...,Np} 


(3.8) 


Finally,  L(s)  is  proper  and  has  no  poles  or  zeros  in  the  open  right  half  plane. 

For  later  reference,  consider  the  following  class  of  functions. 

Given  F(s),  define 


M(R)  -  sup  |F(Rej9)|  ,  6  €  [-xr/2  ,tt/2  ] 

9 


Then  F(s)  is  said  to  be  in  class  R  provided 


lim  f  M(R)  -  0 
R 


(3.9) 


Class  R  includes  many  functions  of  interest.  If  L  (s)  is  a  proper  rational 

d1 

function,  then  log  L  (s)  and  — 7  log  L  (s)  are  in  class  R.  Functions  of  the 

o  ,10 

ds 

form  log  L(s)  with  L(s)  ■  LQ(s)e  Sl,  x  >  0,  are  not.  If,  however,  the  feedback 


33 


system  with  sensitivity  function  (2.2)  is  stable,  then  log  S(s)  and  — -log  S(s) 

ds1 

are  in  class  R  despite  any  time  delay  in  L(s). 

From  (3.2)  and  (3.3)  it  is  clear  that  right  half  plane  poles  and 
zeros  of  L(s)  constrain  the  values  of  S(s)  and  T(s)  at  these  points  in  the 
right  half  plane  [15,16,20,22].  One  way  of  expressing  these  constraints, 
following  immediately  from  (3.2)  and  (3.3),  is  now  given. 

At  each  closed  right  half  plane  zero,  z,  of  multiplicity  m  it 
follows  that 


S(z)  =  1 


ds  s  =  z 


i-1 , . . . ,m-l 


Similarly,  at  each  closed  right  half  plane  pole,  p,  of  multiplicity  n  it 


follows  that 


T(p)  =  1 


ds  s  =  p 


0  i=l , . . . ,n-l 


Note  that  the  above  constraints  can  be  expressed  in  terms  of  either  S  or  T 
via  the  identity 


S (s)  +  T(s)  *  1 


(3.10) 


Assume  that  the  feedback  system  is  stable.  Then  S(s)  and  T(s) 
have  no  poles  in  the  closed  right  half  plane.  In  order  to  express  the 


constraints  (A)  due  to  open  right  half  plane  zeros  in  terms  of  jS(ju)|,it  is 
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necessary  to  remove  the  zeros  of  S(s)  at  the  open  right  half  plane  poles  of 
L(s) .  Note  the  sensitivity  function  can  be  factored  as 


S(s)  =  SCs)Bp(s) 


(3.11) 


where  S(s)  has  no  poles  or  zeros  in  the  open  right  half  plane.  Since  Blaschke 
products  are  all-pass  of  unit  magnitude  ( |  B^ ( j to)  |  *  1  Vu>),  it  follows  that 
|S(jaj)|  *  |S(jto)|  Vu.  The  constraints  upon  the  sensitivity  function  at  open 
right  half  plane  zeros  (A)  can  be  expressed  in  terms  of  the  sentivity  function 
on  the  jou-axis  as  follows. 

Theorem  3.1:  Let  z  =  x  +  jy  be  an  open  right  half  plane  zero,  with  multiplicity 

d1 

m,  of  the  open  loop  transfer  function  L(s).  Assume  that  — r  log  §(s)  is  in 

ds 

class  R,  i— ,1, . . . ,m-l.  Then,  if  the  corresponding  feedback  system  is  stable, 
the  sensitivity  function  must  satisfy  the  following  integral  constraints:* 


irlog  |  B~^  (z)  |  =/  log|S(ju))|  d0  (oj) 
P  z 

c  _  an 


(3.12)  ^ 


TT*  Bp(z)  =  /  *  S(jaj)  d02(w) 


,  i_  log  B-i(s)  . ;  i 

ds  s  =  z  -«  ds 


log  S(s) 


s  ■  joo 


(3.13) 


d0  (co)  (i=l, . . .  ,m-l)  .  (3.14) 

z 


The  function  9z(w)  is  given  by 


0  (oo)  *  arctan  [ — — 
z  x 


(3.15) 


Proof:  A  simple  application  of  Poisson's  Integral  Formulas  [26,27].  See 
Appendix  A  for  details.  ■ 


The  results  of  Theorems  3.1  and  3.3  remain  valid  even  if  L(s)  has 
ju-axis  poles.  Similarly,  Theorem  3.2  remains  valid  if  L(s)  has  jw-axis  zeros. 
This  is  discussed  further  in  Appendix  A. 


In  order  to  express  the  constraints  (B)  due  to  open  right  half 


plane  poles  in  terms  of  |T(jco)|  it  is  necessary  to  remove  any  zeros  of  T(s) 
at  the  open  right  half  plane  zeros  of  L(s)  as  well  as  any  time  delay.  Assume 
again  that  the  feedback  system  is  stable.  Then  the  complementary  sensitivity 
function  can  be  factored  as 


T(s)  -  T(s)B  (s)e 
z 


(3.16) 


where  T(s)  has  no  poles  or  zeros  in  the  open  right  half  plane.  The  fact  that 
Bz(s)  and  e  ST  are  all-pass  functions  of  unit  magnitude  implies  |T(jto)|  *  | T (j 
The  constraints  upon  the  complementary  sensitivity  function  at  open  right  half 
plane  poles  (B)  can  be  expressed  in  terms  of  this  function  on  the  jto-axis  as 
follows : 

Theorem  3.2:  Let  p  =  x+jy  be  an  open  right  half  plane  pole,  with  multiplicity 

d i 

n,  of  the  open  loop  transfer  function  L(s).  Assume  that  — r  log  T(s)  is  in 

ds1 

class  R,  i=0,l, . . . ,n-l.  Then,  if  the  corresponding  feedback  system  is  stable, 
the  complementary  sensitivity  function  must  satisfy  the  following  integral 
constraints : 


TTlog|B  1  (p)  I  +  ttxt  =  /  log |  T(jto)  |  de  (to) 

z  _  _  P 


TT*-  B  ^“(p)  +  Tfyt  =»  /  *  T(jto)  d6  (to) 

2  —  P 


TT-<L_.  (log  B  1(s)eST) 

ds1 


00 

=  /  — i  log  T(s) 

s  =  p  -  ®  ds 


i  S  =  j  to 


■  d9  (to) 
P 


(3.17) 

(3.18) 

(3.19) 


The  function  9  (to)  is  given  by 
P 


9^  (to)  *  arctan  ^  ] 


(3.20) 


Proof :  See  Appendix  A. 


Each  of  Che  integral  relations  of  Theorems  3.1  and  3.2  places  a 
constraint  upon  the  sensitivity  or  complementary  sensitivity  funciton.  For 
the  purposes  of  this  chapter  the  constraints  which  are  most  insightful  are 
those  given  by  (3.12)  and  (3.17).  These  constraints  give  the  area  under  the 
log|S(jui)|  and  log|T(jto)|  curves;  the  area  is  calculated  using  the  jw-axis 
weighted  by  the  location  of  a  right  half  plane  zero  or  pole,  respectively. 
The  weighting  function 

8  (oi)  -  arctan  ,  s*x  +  jy 

S  X 


is  shown  in  Figure  3.2. 

It  is  of  particular  significance  that  (3.12)  and  (3.17)  constrain 
the  integrals  of  log  |  S ( Joj)  |  and  log [  T( juj)  |  .  This  fact  implies  that  feedback 
properties  in  different  frequency  ranges  are  not  independent.  To  see  this, 
note  that  since 


d0  (oj) 
s 

do) 


x2+  (y-  w)2 


0 


it  follows  that  0g(u>)  is  an  increasing  function  of  u>.  Moreover,  the  terms  on 
the  left  hand  sides  of  equations  (3.12)  and  (3.17)  are  nonnegative.  These 
facts  reveal  that  systems  which  reduce  the  response  due  to  disturbances  or 
sensor  noise  (jS(ju>)|  <  1  or  |T(jw)|  <  1)  in  some  frequency  range  necessarily 
increase  this  response  at  other  frequencies.  Thus,  feedback  properties  at 
different  frequencies  must  be  traded  off  against  one  another  to  achieve  a 
satisfactory  design.  These  tradeoffs  will  be  discussed  further  in 


Sections  3. A  and  3.5. 


It  seems  intuitive  that  right  half  plane  poles  and  zeros  which 
are  close  to  frequency  ranges  over  which  design  objectives  are  given 
constitute  a  greater  obstacle  to  the  achievement  of  these  objectives  than 
if  these  poles  and  zeros  were  far  away.  The  weighting  function  appearing 
in  the  integral  relations  verifies  this  intuition  and  yields  a  precise  notion 
of  the  proximity  of  a  zero  or  pole.  The  relation  between  the  locations  of 
zeros  and  poles  and  the  corresponding  weightings  is  discussed  further  in 
Section  3.6. 

Finally,  note  that  the  weighted  length  of  the  ju>-axis  is  finite 
(and  equal  to  ir)  .  This  implies,  for  example,  that  it  is  not  possible  to 
trade  off  a  given  amount  of  sensitivity  reduction  by  allowing  |S(jw)|  to 
exceed  one  by  an  arbitrarily  small  amount  over  an  arbitrarily  large 
frequency  range.  The  amount  by  which  ]S(jco)|  exceeds  one  cannot  be  made 
arbitrarily  small.  The  significance  of  this  observation  will  become  clear 
in  Section  3.3,  where  an  extension  of  the  well-known  Bode  integral  theorem 
is  presented. 

3.3.  Generalization  of  Bode's  Integral  Theorem 

The  purpose  of  this  section  is  to  extend  a  well-known  theorem  of 
Bode  [7]  to  open  loop  unstable  systems.  Bode's  original  result  was  valid 
only  for  open  loop  stable  systems,  despite  a  claim  of  Horowitz  [9,  p.  307] 
to  the  contrary.  The  implications  of  this  result  for  feedback  design  have 
been  discussed  by  Horowtiz  [9]  and  others  [3,18]. 
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Bandwidth  constraints  in  feedback  design  typically  require  that  the 
loop  gain  be  small  above  a  specified  frequency.  In  addition,  it  is  frequently 
required  that  the  loop  gain  possess  greater  than  a  one  pole  roll-off  above 
tl  at  frequency.  These  constraints  commonly  arise  due  to  the  need  to  provide 
for  stability  robustness  despite  uncertainty  in  the  plant  model  at  high 
frequencies.  Bandwidth  constraints  also  arise  due  to  limitations  imposed  by 
actuators  and  sensors.  One  way  of  quantifying  such  constraints  is  by 
requiring  that 
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where  k  >  0  and  — <  m.  The  positive  value  of  k  insures  that  a  greater 
than  one  pole  roll-off  is  obtained  while  the  value  of  m  imposes  a  bound  on 
the  magnitude  of  the  loop  gain. 

Bandwidth  constraints  such  as  (3.21)  in  turn  impose  a  constraint 
upon  the  integral  of  the  log  magnitude  of  the  sensitivity  function.  First, 
the  requirement  that  the  loop  gain  have  greater  than  a  one  pole  rolloff 
yields  the  following  theorem: 

Theorem  3.3:  Assume  that  the  open  loop  transfer  function  L(s)  possesses 
finitely  many  open  right  half  plane  poles  {pi  :  i=l,...,Np}  including 
multiplicities.  In  addition,  assume  that 

lim  sup  R|L(s)|  =  0  .  (3.22) 

R-00  I S  |  >R 
Re[s ]>0 
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Then,  if  the  closed  loop  system  is  stable,  the  sensitivity  function  must 
satisfy 

N 

p  00 

n  Z  Re[p  ]  =  /  log |  S (ju>)  |  dcj  .  (3.23) 

i-1  0  ■ 

Proof:  See  Appendix  A.  ■ 

Note  that  Theorem  3.3  is  valid  for  systems  which  include  right 
half  plane  zeros  and  time  delays  in  L(s). 

If  N  *0,  then  Theorem  3.3  reduces  to  Bode's  theorem.  This 
P 

theorem  states  that  on  a  plot  of  log|S(jtu)|  versus  uj  the  sensitivity  reduction 
area  (log|S(jw)|  <  0)  must  equal  the  area  of  sensitivity  increase  (log |  S ( jco)  |  >  0) 
in  units  of  decibels  *  (radians/sec) . 

If  Np  >  0,  then  the  area  of  sensitivity  reduction  is  less  than  the 
area  of  sensitivity  increase  by  an  amount  proportional  to  the  sum  of  the 
distances  from  the  unstable  poles  to  the  imaginary  axis.  This  indicates  that 
a  portion  of  the  loop  gain  which  could  otherwise  contribute  to  sensitivity 
reduction  must  instead  be  used  to  pull  the  unstable  poles  into  the  left  half 
plane . 

By  itself,  Theorem  3.3  does  not  impose  a  meaningful  design 
limitation  since  the  necessary  area  of  sensitivity  increase  can  be  obtained 
by  allowing  |S(j<u)|  to  exceed  one  by  an  arbitrarily  small  amount  over  an 
arbitrarily  large  frequency  range.  (In  this  respect  Theorem  3.3  differs  from 
Theorems  3.1  and  3.2.)  However,  only  part  of  the  bandwidth  constraint  (3.21) 
was  used  to  obtain  Theorem  3.3;  namely,  the  fact  that  k>0  implies  (3.22)  is 
satisfied.  Practical  bandwidth  constraints  also  specify  the  value  of  m  in 
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(3.21).  Thus  (3.21)  implies  there  exists  a  frequency  u>  such  that  — i+k*  £  <  11 

wc 

It  follows  that 

| L(jco)  |  <  <  e  ,  a)>u>c  ,  (3.24) 

a) 

where  k>0.  This  property  of  the  open  loop  transfer  function  yields  the 
following  bound. 

Corollary:  Assume  that,  in  addition  to  (3.22),  the  transfer  function  L(s) 
satisfies  the  bound  (3.24).  Then 


^  log |  S(juj)  |  dui  <  log[^  *  £  ]  •(*>£ 


(3.25) 


Proof:  See  Appendix  A.  ■ 

The  bound  (3.25)  is  crude  and  in  fact  is  an  optimistic  estimate 
of  the  integral  in  question.  Nonetheless,  it  indicates  how  bandwidth 
constraints  which  limit  the  loop  gain  as  a  function  of  frequency  impose  a 
tradeoff  upon  system  sensitivity  properties.  Suppose  that  a  given  level  of 
sensitivity  reduction  is  desired  over  some  low  frequency  range.  Then  (3.25) 
places  an  upper  bound  on  the  area  of  sensitivity  increase  which  can  be 
obtained  at  frequencies  greater  that  uic  and,  therefore,  a  lower  bound  on  the 
area  of  sensitivity  increase  which  must  be  present  at  lower  frequencies.  This 
fact  can  be  used  to  obtain  a  lower  bound  (greater  than  one)  on  the  maximum 
value  of  sensitivity  increase  below  Note  that  the  bound  (3.25)  can  be 

increased  only  by  relaxing  the  bandwidth  specification.  In  practice  this  may 
not  be  possible  due  to  the  necessity  of  insuring  stability  robustness.  Thus 


a  tradeoff  is  imposed  among  system  sensitivity  properties  in  different 
frequency  ranges.  The  benefits  of  sensitivity  reduction  in  one  frequency 
range  must  be  obtained  at  the  cost  of  increased  sensitivity  at  other 
frequencies  whenever  bandwidth  constraints  are  imposed.  The  generalization 
of  Bode's  Theorem  presented  here  shows  that  this  cost  is  greater  for  open 
loop  unstable  systems. 


3.4.  Limitations  on  the  Sensitivity  Function  due  to  Open  Right  Half 
Plane  Zeros 

In  Section  3.2  it  was  shown  that  the  presence  of  open  right  half 
plane  zeros  places  constraints  upon  the  system  sensitivity  function.  These 
constraints  show  that  if  sensitivity  reduction  (|S(jw)|  <  1)  is  present  in  some 
frequency  range, then  there  necessarily  exist  other  frequencies  at  which  the  use 
of  feedback  increases  sensitivity  (|S(jw)|  >1).  The  purpose  of  this  section  is 
to  illustrate  this  requirement  by  deriving  some  lower  bounds  on  the  maximum 
amount  of  sensitivity  increase  given  that  a  certain  level  of  sensitivity 
reduction  has  been  achieved  over  some  frequency  range. 

For  a  given  plant  model  P(s)  suppose  it  is  desired  to  design  a 
feedback  compensator  F(s)  such  that  a  specified  level  of  sensitivity  reduction 
is  obtained  over  a  conjugate  symmetric*  range  of  frequencies  ft.  Let  the  desired 
level  of  sensitivity  reduction  be  given  by 

j  S(ju>)  |  <  a  <  1  Vco E  ft  (3.26) 

Let  z  be  an  open  right  half  plane  zero  of  L(s)  and  let  the  weighted  length 
of  the  frequency  range  ft  be  denoted 

it 

Conjugate  symmetry  implies  that  if  ft. 


then  —wuS  . 


(3.2 


0  (ft)  -  /  de  (u) 
z  a  z 

The  weighted  length  of  the  complementary  frequency  range  ft  =  {uj  :uj?ft} 
is  given  by 


0  (ftC)  -  ir  -  0  (ft) 
z  z 


(3.2 


From  Figure  3.3  it  is  clear  that  n  >  ©  (ft)  >  0  provided  that  ft^4>  and  ft?4®. 

z 

It  is  also  clear  that  if  ft,  C  ft  ,  then  0  (ft„)  >  0  (ft.). 

l  —  i.  zz-zl 

Define  the  maximum  sensitivity 


Js|.  "  SUP  |S(ju) 


(3.2 


UJ 


The  following  theorem  gives  a  lower  bound  on  the  maximum  sensitivity  due  to 
achievement  of  the  sensitivity  reduction  level  (3.26)  for  a  nonminimum  phase 
system.  As  this  lower  bound  is  greater  than  one,  it  follows  that  the  closed 
loop  system  exhibits  a  sensitivity  increase  over  some  frequency  range. 

Theorem  3.4:  Let  the  open  loop  transfer  function  L(s)  have  open  right  half 
plane  poles  and  zeros  given  by  (3.8)  and  (3.6).  Suppose  that  the  closed  loop 
system  is  stable  and  that  the  level  of  sensitivity  reduction  (3.26)  has  been 
achieved.  Then  for  each  the  following  bound  must  be  satisfied: 


e2<« 


sl- : 


1,  TT  -  0  (ft) 


.-1. 


Bp  (z) 


IT  -  ©  (ft) 
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(3.3l 


where  0  (ft)  is  given  by  (3.27). 
z 

Proof :  From  (3.12) 

rrlog i  B  1  (z)  |  *  /  log  i  S ( j uj)  |  d9  (w)  +  /  log]  S(joj)  !  d0  (u.) 


(3.3: 
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Since  sup|S(jw)|  <  a  by  design,  and  since  sup|S(ju>)j  <  JjSj^  by  definition, 
n  lc 

it  follows  that 


irlog  |  B”1  (z)  |  <  log  (a)  ©,,(1)  +  log| Sj ^  ©z(lC) 


(3.32) 


Exponentiating  both  sides  of  (3.32)  yields  the  result.  ■ 

In  general,  a  different  lower  bound  is  obtained  for  each  zero  (an 

exception  being  pairs  of  complex  conjugate  zeros) .  Note  that  a  similar  bound 

has  been  derived  by  Francis  and  Zames  [21,  Theorem  3].  The  bound  of  Theorem 

3.4  is  less  crude,  and  the  proof  and  results  are  more  insightful.  In  addition, 

although  (3.30)  is  derived  for  the  simple  specification  (3.26)  the  method 

readily  extends  to  more  general  specifications.  This  is  clear  from  the  proof. 

Inequalities  (3.30)  and  (3.32)  must  be  satisfied  for  each  open 

right  half  plane  zero  of  L(s) .  Before  discussing  the  tightness  of  these 

inequalities  the  significance  of  each  term  will  be  briefly  explained.  First, 

however,  note  the  facts  that  a<  1 , 1 B  (z)  I  >  1,  and  0  01)  <  tt  imply  that  the 

p  z 

right  hand  side  of  (3.30)  is  strictly  greater  than  one.  This  verifies  that 

the  maximum  sensitivity  is  indeed  greater  than  one. 

The  term  0  (1)  given  by  (3.27)  is  the  weighted  length  of  the 
z 

frequency  range  over  which  sensitivity  reduction  is  desired;  the  term 
0(1  )  given  by  (3.28)  is  the  weighted  length  of  the  complementary  frequency 
range.  The  relation  between  these  weighted  lengths  and  the  location  of  the 
zero  is  discussed  in  Section  3.6.  To  illustrate,  Figure  3.4  shows  how  the 
weighted  length  of  the  frequency  interval  1  =  [0,uo]  varies  as  a  function 
of  '^2  for  a  zero  at  s  =  1  and  at  s*l  +  j.  From  Figure  3.4  and  inequalities 
(3.30)  and  (3.32),  it  is  clear  that  a  significant  level  of  sensitivity  reduction 


at  frequencies  near  a  right  half  plane  zero  is  necessarily  accompanied  by  a 
large  sensitivity  increase  at  other  frequencies.  Moreover,  suppose  that 
sensitivity  reduction  is  desired  over  all  but  a  lightly  weighted  portion  of 
the  jw-axis.  Then  Theorem  3.4  shows  that  the  accompanying  sensitivity 
increase  must  be  greater  than  if  sensitivity  is  permitted  to  exceed  one  at 
more  heavily  weighted  frequencies.  The  above  comments  are  illustrated  in 
Figure  3.5  by  plotting  the  lower  bound  on  loglsj^  for  the  frequency  interval 
Q  -  [0,1  rad/sec],  zeros  at  s=l  and  s»l  + j,  and  various  levels  of  sensitivity 
reduction.  The  system  in  this  example  is  assumed  to  be  open  loop  stable. 

If  the  system  is  open  loop  unstable  then  the  lower  bound  (3.30)  on 
|S|ot  is  increased  as  a  function  of  the  proximity  of  the  unstable  poles  to 
the  zero  in  question.  This  is  a  consequence  of  the  fact  that  the  weighted 
area  under  the  log  j  S ( j co) |  curve  is  positive  for  open  loop  unstable  systems. 

Thus  the  presence  of  unstable  poles  in  the  open  loop  transfer  function  tends 
to  worsen  the  sensitivity  performance  of  the  closed  loop  system.  Since 


Trlog|Bp1(z) 


P  P,+z 

Z  log  — — 
.=1  Pi"Z 


(3.33) 


it  follows  (unsurprisingly)  that  systems  with  approximate  right  half  plane 
pole-zero  cancellations  can  have  especially  bad  sensitivity  properties. 

As  an  example,  the  magnitude  of  one  term  of  (3.33)  is  plotted  in  Figure  3.6 
versus  various  locations  of  a  real  pole  for  a  zero  at  s=  1.  The  effect  of 
an  unstable  pole  at  s  =*  2  upon  the  lower  bound  (3.30)  for  zeros  at  s  *  1  and 
s  *  1  +  j  and  various  levels  of  sensitivity  reduction  over  the  frequency 


Level  of  Sensitivity  Reduction  in  ft,  a 


Figure  3.5.  Lower  bound  on  maximum  sensitivity:  Open  loop  stable 


interval  fi  *  [0,1]  is  illustrated  in  Figure  3.7.  This  bound  should  be 
compared  with  that  for  an  open  loop  stable  system  plotted  in  Figure  3.5. 

In  general,  the  bounds  of  Theorem  3.4  will  not  be  tight  for  a 
variety  of  practical  as  well  as  theoretical  reasons.  Consider  the  bound 
(3.30)  for  a  single  zero.  From  (3.31)  it  follows  that  this  bound  is  satisfied 
with  equality  by  a  function  S(s)  for  which 


|  S(jw) 


to  £  0 


(3.34) 


where  fSj^  is  given  by  the  right  hand  side  of  (3.30).  The  function  S(s) 
is  illustrated  in  Figure  3.8  for  an  open  loop  stable  system  with  a  zero  at 
s  =  l  +  j  and  a  level  of  sensitivity  reduction  a*  .1  over  the  frequency  interval 
[0,1  rad/sec].  This  function  has  the  minimum  possible  value  of  maximum 
sensitivity  increase  of  all  functions  satisfying  the  sensitivity  specification 


(3.26) . 


As  mentioned  earlier,  a  sensitivity  function  with  a  gain  characteris¬ 


tic  as  in  (3.34)  and  Figure  3.8  is  in  practice  neither  achievable  nor  desirable 
It  is  not  achievable  due  to  the  corresponding  requirement  of  an  infinite 
bandwidth  open  loop  transfer  function,  nor  desirable  due  to  robustness  considera 
tions.  In  most  cases  robustness  constraints  will,  at  a  minimum,  require  that 


S(joj)  —  1 1  <  e  for  uj€  =  [uj^,00] 


(3.35) 


The  effect  of  the  constraint  (3.35)  may  be  analyzed  by  a  straightforward 
modification  of  Theorem  3.4.  As  e-*-0,  the  bound  (3.30)  is  replaced  by 


Lower  Bound  on  log  I SL 


Figure  3.7.  Effect  of  unstable  pole  on  maximum  sensitivity  bound 


log  IS(jco)l 
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Figure  3.8.  Sensitivity  function  achieving  equality  in  (3.30) 
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A  c 

where  =  Q  n  •  As  0^(8')  is  an  increasing  function  of  u)^»  it 

follows  that  for  fixed  values  of  z,  a,  and  u>2  the  minimum  possible  value 
of  | S| m  increases  as  decreases .  This  minimum  value  would  be  attained  by 
a  design  for  which  equality  is  achieved  in  (3.36).  The  gain  characteristic 
of  such  a  sensitivity  function  for  the  data  of  Figure  3.8  is  plotted  in 
Figure  3.9  for  various  values  of  tuy  Note  the  effect  of  the  weighting 
function  upon  the  value  of  jsj^  as  a  function  of  For  this  example  the 

increase  in  maximum  sensitivity  due  to  the  requirement  (3.35)  is  negligible 
for  sufficiently  large. 

Even  though  the  gain  characteristics  of  Figures  3.8  and  3.9  are 
discontinuous  they  may  be  approximated  arbitrarily  closely  by  stable  rational 
functions.  Thus,  the  bounds  (3.30)  or  (3.36)  would  be  tight  if  no  other 
constraints  on  sensitivity  were  present.  In  order  to  realize  these  sensitivity 
functions  by  applying  feedback  around  a  given  plant,  however,  integral  relations 

(3.12) -(3.14)  due  to  each  open  right  half  plane  zero  of  L(s)  must  be  satisfied. 
Thus,  the  sensitivity  function  constructed  as  in  Figure  3.8  or  3.9  to  satisfy 

(3.12)  for  one  zero  will  not,  in  general,  satisfy  integral  gain  relations  for 
other  zeros.  Recall  that  the  sensitivity  functions  of  Figures  3.8  and  3.9  are 
constructed  to  yield  the  minimum  possible  value  of  Jsl  for  the  given 
sensitivity  specification  under  the  constraint  imposed  by  a  single  zero. 

Thus,  the  implication  of  the  preceding  discussion  is  that  this  minimum 
possible  value  is  optimistic  and  the  lower  bounds  (3.30)  and  (3.36)  cannot 
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be  tight.  Finally,  the  practical  need  to  limit  complexity  of  the  compensator 
will  limit  the  ability  to  realize  piecewise  constant  functions  for  which  the 
bounds  are  tight. 

The  results  of  Theorem  3.4  can  be  useful  in  applications  by  allowing 
an  estimate  of  the  minimum  price,  in  terms  of  sensitivity  increase,  which 
must  be  paid  for  a  given  level  of  sensitivity  reduction  over  an  interval.  For 
example,  given  ft,  ft'  and  a  as  in  Figure  3.9,  the  minimum  possible  value  of 
J  Sj ^  can  be  computed  using  the  above  procedure  for  each  zero.  If  this  value 
is  too  large  for  any  zero  then  it  may  be  necessary  to  reduce  the  level  of 
sensitivity  reduction  a.  Alternatively,  the  locations  of  the  frequency 
intervals  ft  and  ft'  could  be  modified.  There  are,  of  course,  other  tradeoffs 
involved;  for  example,  increasing  the  system  bandwidth  may  not  be  permissible 
due  to  robustness  considerations. 

3.5.  Limitations  on  the  Complementary  Sensitivity  Function  Imposed  by 
Unstable  Poles 

In  Section  3.2  it  was  shown  that  the  presence  of  open  right  half 
plane  poles  places  constraints  upon  the  complementary  system  sensitivity 
function.  These  constraints  show  that  there  exists  a  tradeoff  among  system¬ 
sensor  noise-rejection  properties  in  different  frequency  ranges.  This 
tradeoff  can  be  thought  of  as  dual  to  that  imposed  upon  the  sensitivity 
function  by  right  half  plane  zeros. 

Lower  bounds  on  the  maximum  value  of  |T(joo)|  can  be  derived  which 
are  similar  to  those  of  Section  3.4.  Specifications  on  sensor  noise  response. 


analogous  to  (3.26)  are  usually  imposed  at  high  frequencies.  Assumption  of 
such  a  specification  leads  to  a  bound  similar  to  (3.30).  At  low  frequencies 
|T(ju))  |  is  usually  constrained  to  be  near  unity  by  the  requirement  of  small 
sensitivity.  This  fact  can  be  used  to  construct  a  lower  bound  similar  to 
(3.36). 

One  difference  between  the  results  of  Sections  3.4  and  3.5  is  that 
time  delays  worsen  the  tradeoff  upon  sensor  noise  reduction  imposed  by 
unstable  poles.  This  is  plausible  for  the  following  reasons.  The  use  of  feedback 
around  an  open  loop  unstable  system  is  necessary  to  achieve  stability.  Time 
delays,  as  well  as  right  half  plane  zeros,  impede  the  processing  of  information 
around  a  feedback  loop.  Thus,  it  is  reasonable  to  expect  that  limitations 
due  to  unstable  poles  are  worse  when  time  delays  and/or  right  half  plane  zeros 
are  present.  Note  in  particular  that  the  term  due  to  the  time  delay  in  (3.17) 
is  porportional  to  the  product  of  the  length  of  the  time  delay  and  the  distance 
from  the  unstable  pole  in  question  to  the  jw-axis.  This  is  consistent  with 
the  above  interpretation. 

It  should  also  be  noted  that  the  reciprocal  of  the  complementary 
sensitivity  function  has  been  interpreted  as  a  measure  of  system  stability 
margin  against  unstructured  multiplicative  uncertainty  [4,11-14,20].  Under 
this  interpretation,  Theorem  3.2  shows  that  unstable  poles  also  impose  a 
tradeoff  upon  the  size  of  this  measure  of  stability  margin  in  different 
frequency  ranges.  Thus,  this  stability  margin  cannot  be  large  at  all 
frequencies . 
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3.6.  Dependence  of  Weightings  upon  Pole/Zero  Location 

In  the  previous  sections  it  was  shown  that  the  weighted  length  of 
various  frequency  intervals  is  important  in  determining  the  tradeoffs  imposed 
by  right  half  plane  poles  and  zeros  via  the  integral  relations  of  Theorems  3.1 
and  3.2.  Intuitively,  the  difficulty  in  achieving  the  benefits  of  feedback  is 
a  function  of  the  proximity  of  such  zeros  and  poles  to  frequency  ranges  over 
which  design  specifications  are  imposed.  The  fact  that  weighting  functions 
appear  in  the  integral  relations  justifies  this  intuition  and  allows  the  notion 
of  proximity  to  be  made  precise. 

The  purpose  of  this  section  is  to  discuss  the  dependence  of  the 
weighting  assigned  to  a  frequency  interval  upon  the  relative  location  of  the 
pole  or  zero  and  the  interval.  This  dependence  can  be  seen  qualitatively  from 
Figure  3.3.  The  quantitative  analysis  in  this  section  should  prove  useful  in 
constructing  design  specifications  which  reflect  the  tradeoff  between  benefits 
and  cost  of  feedback  imposed  by  right  half  plane  poles  and  zeros.  The  discus¬ 
sion  of  this  section  uses  weightings  imposed  by  right  half  plane  zeros,  but 
identical  results  hold  for  weightings  imposed  by  right  half  plane  poles. 

For  purposes  of  illustration,  consider  the  frequency  interval 

3  *  [oo^  ,ol>2 ]  U  [-a^.-to^]  .  (3.37) 

Then  (3.37)  yields 


0  (fi)  =  9  ( co  ^ )  -  0  (to.)  +  0  (-to. )  -  0  (~!00) 
z  z  l  zl  zl  z  2 


(3.38) 


From  Figure  3.3  and  Equation  (3.38)  it  is  obvious  that  ©  (ft)  is  a  monotonically 

z 

increasing  function  of  Aw  =  For  fixed  values  of  w^  and  z,  a  simple 

calculation  reveals  that 
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(3.39) 


where  (see  Figure  3.2) 

r  (ft)  -  /x^  +  (y-w)^  ,  s  =  x+jy  .  (3.40) 

s 

Equation  (3.39)  confirms  that  the  severity  of  the  tradeoff  imposed  by  the 
integral  relation  (3.12)  and  estimated  by  the  bound  (3.30)  becomes  increasingly 
worse  as  the  length  of  the  frequency  interval  over  which  a  given  level  of 
sensitivity  reduction  is  desired  is  increased.  For  a  real  zero,  the  two  terms 
on  the  right  hand  side  of  (3.39)  are  equal  in  magnitude  and  are  monotonically 
decreasing  functions  of  Aw.  For  a  complex  zero  in  the  upper  half  plane  the 
first  term  dominates,  is  an  increasing  function  of  Aw  until  rz(w^ + Aw) =  Im[z] , 
and  decreases  thereafter  (Figure  3.3).  The  second  term  is  monotonically 
decreasing.  These  observations  indicate  that  the  greatest  incremental  degrada¬ 
tion  in  performance  due  to  an  incremental  increase  in  the  length  of  an  interval 
of  sensitivity  reduction  occurs  for  values  of  Aw  such  that  is  in  the  vicinity 
of  y=Im[z].  This  is  verified  for  the  frequency  interval  w^  =  0,  Aw  *  ^  and 
zeros  at  s=  .1  and  sa  ,1  +  j  in  Figure  3.10. 

Another  interesting  result  is  obtained  by  fixing  the  length  of  the 
frequency  interval  and  varying  the  location  of  the  interval  relative  to  the 
zero.  If  Aw <  |z|,  then  from  Figure  3.3  it  follows  that  as  w^  is  increased 


from  0  to  00  the  weighted  length  of  the  frequency  interval  increases  and 
then  decreases.  This  is  verified  quantitatively  from 
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a  real  2ero  the  two  bracketed  terms  in  (3.41)  are  equal  and 
For  a  complex  zero  in  the  upper  half  plane  the  second  term  is 
The  first  term  is  monotonically  decreasing  and  equal  to  zero  for 
Together  the  contributions  of  the  two  terms  indicate  that  the 
weighted  length  of  the  frequency  interval  reaches  a  maximum  when  the  midpoint 
of  the  frequency  interval  is  somewhat  less  than  the  imaginary  component  of  the 
zero,  with  corresponding  effect  on  the  difficulty  of  achieving  suitable 
sensitivity  performance.  This  is  illustrated  in  Figure  3.11  for  a  frequency 
interval  ft  =  [o)^,oj^+1]  and  zeros  at  s  =  1  and  s^l+lOj. 

It  is  also  interesting  to  consider  the  effect  of  varying  the  location 
of  the  zero  relative  to  a  fixed  frequency  interval.  Of  course,  plant  zeros 
cannot  be  varied  in  practice;  however,  this  analysis  can  provide  infromation  as 
to  which  of  several  zeros  causes  the  most  difficulty  in  design. 

A  simple  calcualtion  shows  that,  for  z=x+jy. 
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For  a  real  zero, (3.42)  reduces  to 


3®_(0)  2(w„-a), )  C-x  +u,u)„) 


From  Figure  3.3  it  is  clear  that  as  x  is  increased  from  zero, 

0  (ft)  first  increases  and  then  decreases.  Equation  (3.43)  reveals  that  the 
z 

maximum  value  of  0  (ft)  is  achieved  for  x  =  /w-w- .  For  a  complex  zero  in  the 

z  1  L 

upper  half  plane,  each  of  the  two  terms  in  (3.42)  increases  and  then  decreases 
as  x  is  increased  from  zero.  The  first  term  is  zero  at  x  =  / (y-uj^)  (y-u^)  \  the 
second  term  is  zero  at  x  =  / (y+oo^)  (y+u^)  .  Thus,  the  maximum  value  of  ©z(ft) 
is  achieved  at  some  intermediate  value  of  x,  which  could  be  determined  explicit 
ly  from  (3.42).  These  results  are  illustrated  in  Figure  3.12  for  the  frequency 
interval  ft  »  [0.1,5]  and  zeros  at  s  *  x  and  s  =  x+10j. 

Another  simple  calculation  shows  that 
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As  y  is  increased  from  0,  the  second  term  in  (3.44)  is  negative;  the  first 

<^+^2 

term  is  monotonically  decreasing  and  is  zero  at  y  =  : — - — .  Note  that  this 


result  is  consistent  with  (3.41). 


The  purpose  of  this  section  is  to  briefly  comment  on  the  constraints 
that  right  half  plane  poles  and  zeros  of  a  matrix  open  loop  transfer  function 
impose  upon  the  corresponding  matrix  sensitivity  and  complementary  sensitivity 
functions.  Although  the  situation  is  more  complicated  than  for  single  loop 
systems, some  useful  results  can  be  obtained  using  the  results  of  this  paper. 
Again,  the  results  are  illustrated  for  the  sensitivity  function  although 
analogous  statements  can  be  made  about  the  complementary  sensitivity  function. 

Let  L(s)  €  <jn  n  be  a  matrix  of  transfer  functions  and  assume  that 
the  feedback  system  whose  sensitivity  function  is  the  matrix  S(s)  *  [I  +  L(s)]  ^ 
is  stable.  A  commonly  used  measure  of  system  sensitivity  reduction  [4,11-14] 
is  the  largest  singular  value,  or  matrix  two-norm,  of  the  sensitivity  matrix. 
The  results  of  this  paper  cannot,  in  general,  be  applied  to  singular  values. 
Consider  instead  functions  of  the  form 

S  (s)  -  vHS(s)u  ,  (3.45 

VU 

where  u  and  v  are  constant  unit  vectors  in  ®n.  The  response  of  the  system  to 
disturbances  entering  in  the  direction  spanned  by  u  is  given  by  S(s)u.  The 
function  S^Cs)  is  the  component  of  this  disturbance  response  appearing  in 
the  output  direction  spanned  by  v.  Each  function  of  the  form  (3.45)  has 
the  property  that 


|  vHS  ( jw)  u  |  <  5  [SO)]  =  j]  S(  joj)  |  2 


(3.46 
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Thus,  to  insure  good  sensitivity  reduction  properties  at  a  given  frequency, 

it  is  necessary  (but  not  sufficient)  to  insure  that  |v  S(jw)u|  is  small  at 

that  frequency  for  a  particular  choice  of  u  and  v.  For  stable  feedback 

systems,  functions  of  the  form  (3.45)  are  analytic  and  bounded  in  the  closed 

right  half  plane.  Hence  the  results  of  this  paper  can  be  used  to  study  the 

effect  of  right  half  plane  transmission  zeros  upon  the  magnitude  of 

|v  S(joi)u|  and  thus  to  indirectly  study  o[S(joj)]. 

If  S^Cs)  has  zeros  in  the  right  half  plane,  then  these  are  either 

isolated  or  S  (s)  =0.  The  latter  case  is  trivial.  If  the  zeros  are  isolated 
vu 

then  they  can  be  factored  out  using  a  Blaschke  product  as  in  (3.11)  to  form  a 
function 

§VU(S)  *  SVU(S)  B~1(S)  *  <3.47) 

which  has  no  right  half  plane  zeros  and  for  which  |Svu(jw)|  =  |S  (jw)|. 

One  difference  between  the  function  Svu(s)  and  a  scalar  sensitivity 
function  is  that  Svu(s)  can  possess  right  half  plane  zeros  which  are  not  due 
to  unstable  poles  of  L(s).  For  example,  let 
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Then  the  function  Svu(s)  corresponding  to  (3.48)  has  a  zero  at  s=8.  This 
type  of  zero  does  not  necessarily  appear  as  a  consequence  of  internal  stability 
as  do  zeros  at  the  poles  of  P(s)  and  F(s).  Nonetheless,  zeros  of  this  type 
can  be  present  in  a  given  design  and  will  be  seen  to  worsen  tradeoffs  due  to 
the  transmission  zeros  of  L(s). 

If  L(s)  has  a  transmission  zero  at  s  =  z,  then  it  is  easily  verified 
that  S(z)  has  an  eigenvalue  equal  to  one.  Let  w  be  a  unit  magnitude  right 
eigenvector  of  S(z)  corresponding  to  this  eigenvalue.  Then  the  unit  vector  u 
may  be  written  as 

u  =  aw  +  Sw^  (3.49) 

2  2 

where  a  and  B  are  complex  scalars  with  |a|  +  |s|  ■  1,  and  w^  is  a  unit 
vector  orthogonal  to  w.  Then  the  magnitude  of  (3.45)  at  z  is 

| Svu(z) |  =■  |avHw  +  BvHS(z)wi |  .  (3.50) 

Lemma  A.l  can  be  applied  to  show  that 

00 

:rlog  | av\  +  8vHS(z)w^  I  +  7rlog|B  1(z)  j  =  /  log  |  Svu( ju>)  | d6  (w)  .  (3.51) 

_  00 

If  the  input  direction  u  is  equal  to  the  right  eigenvector  w,  then 
(3.51)  reduces  to 

00 

7Tlog|vHw|  +  Trlog  |  B  1(z)  I  =  /  logjs^  (jw)|d9  (w)  .  (3.52) 

—  00 

This  equation  shows  how  the  presence  of  a  right  half  plane  zero  constrains 
the  magnitude  of  v  S(joj)w  provided  that  v,  the  output  direction  of  interest. 


has  a  nonzero  component  in  the  direction  of  the  eigenvector  w.  If  in  fact 

jj 

v»w,  then  v  w«l  and  (3.52)  is  similar  to  the  constraint  (3.12).  If  v  and  w 

JJ 

are  orthogonal,  however,  then  v  S(s)w  has  a  zero  at  s*z.  If  this  zero  is 
isolated  then  it  can  be  removed  via  the  Blaschke  product;  the  value  of  the 
resulting  function  §^(3)  is  not  constrained  at  s*z.  Otherwise,  v  S(s)w  =  0. 
In  either  case  there  is  no  constraint  imposed  upon  the  weighted  integral  of 
loglS^jw)  |  .  If  0  <  |vHw|  <  1,  then  a  constraint  is  imposed  whose  severity 
decreases  as  |v^w|  decreases. 

The  purpose  of  the  above  discussion  is  merely  to  show  how  some 
information  about  the  effects  of  transmission  zeros  on  closed  loop  sensitivity 
can  be  obtained  and  is  not  intended  to  be  complete.  For  example,  results  can 
also  be  obtained  using  left  eigenvectors  of  S(z).  It  should  be  pointed  out 
that  the  results  of  this  section  are  consistent  with  those  of  Wall,  Doyle  and 
Harvey  [19],  who  discuss  multivariable  right  half  plane  zeros  in  terms  of  the 
extra  phase  lag  they  produce  in  certain  directions. 


3.8.  Summary 

In  this  chapter,  limitations  on  feedback,  design  due  to  right  half 
plane  poles  and  zeros  of  the  open  loop  transfer  function  have  been  discussed. 
These  limitations  are  expressed  directly  in  terms  of  the  magnitude  of  the 
sensitivity  and  complementary  sensitivity  functions  evaluated  along  the 
jw-axis.  This  form  of  expressing  the  limitations  should  prove  useful  in  that 
!  S ( j uj)  [  and  | T ( j to)  J  are  directly  related  to  the  quality  of  a  feedback  design. 
The  Bode  integral  theorem  has  also  been  extended  to  the  case  of  open  loop 
unstable  systems. 


The  limitations  of  Theorems  3. 1-3. 3  were  interpreted  as  imposing 
tradeoffs  among  system  properties  in  different  frequency  ranges.  It  should 
be  pointed  out  that  the  origin  of  these  tradeoffs  is  physical  realizability. 
The  property  of  realizability  relevant  to  the  present  context  is  the  fact 
that  the  Laplace  transform  of  the  impulse  response  of  a  physical  system  is  a 
locally  analytic  function  of  the  complex  frequency  variable.  Thus  the  origin 
as  well  as  the  implications  of  the  tradeoffs  discussed  above  are  significantly 
different  from  the  well-known  tradeoff  between  performance  and  robustness  at 
a  single  frequency  imposed  by  the  algebraic  relation  S(s)  +  T(s)  =  1. 
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CHAPTER  4 

COMMENTS  ON  DIRECT  DESIGN  USING 
CLOSED  LOOP  TRANSFER  FUNCTIONS 
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4.1.  Introduction 

As  a  prologue  to  attempting  generalizations  of  classical  single 
loop  feedback  theory  to  multiple  loop  systems,  it  is  at  least  interesting  to 
speculate  upon  why  the  existing  theory  developed  as  it  did.  One  motivation  for 
such  speculation  is  that  it  might  provide  insight  into  how  classical  theory  has 
avoided  various  pitfalls.  This  insight,  in  turn,  might  hopefully  provide 
guidance  into  how  these  same  pitfalls  may  be  avoided  in  extensions  to  multiple 
loop  systems.  Of  particular  interest  in  this  chapter  will  be  the  limitations 
on  feedback  properties  quantified  by  the  Bode  integral  relations  discussed  in 
Chapter  2  and  the  integral  constraints  due  to  right  half  plane  poles  and  zeros 
discussed  in  Chapter  3.  Recall  that  the  ability  to  cope  with  these  limitations 
and  the  tradeoffs  they  impose  was  the  third  important  feature  of  classical 
methods  pointed  out  in  Chapter  2. 

Return  now  to  the  first  two  features  of  classical  methods  discussed 
in  Chapter  2.  First  it  was  observed  that  certain  closed  loop  transfer 
functions  evaluated  along  the  ju-axis  directly  express  the  performance  and 
robustness  properties  of  a  feedback  system  (a  fact  which  has  been  noted  by 
many  authors  for  many  years) .  Two  transfer  functions  of  particular  interest 
are  the  sensitivity  function  and  complementary  sensitivity  function,  denoted 
S(s)  and  T(s)  =  l-S(s),  respectively.  Indeed,  the  values  of  |S(ju)  |  and 
| T(jco)  |  may  reliably  be  used  to  judge  the  quality  of  a  given  feedback  design. 
This  fact,  in  turn,  implies  it  is  also  natural  to  state  design  goals  directly 
as  bounds  upon  |S(ju>)|  and  |T(ja>)|. 
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Despite  the  useful  properties  of  closed  loop  transfer  functions, 
classical  control  techniques  in  fact  proceeded  by  shaping  the  open  loop 
transfer  function,  L(s)  =  P(s)F(s),  where  P(s)  and  F(s)  are  the  plant  and 
compensator  transfer  functions,  respectively.  That  feedback  design  via 
open  loop  shaping  is  feasible  is  no  doubt  due  to  a  second  important  feature 
of  classical  methods.  This  is  the  existence  of  rules  of  thumb  for  relating 
open  and  closed  loop  system  properties. 

At  first  it  might  seem  odd  that  classical  techniques  focussed 
directly  upon  open  loop,  rather  than  closed  loop,  transfer  functions.  A 
little  thought,  however,  suggests  one  (at  least  plausible)  reason  why  this 
might  have  advantages.  This  reason  is  that  the  effect  of  various  limitations 
imposed  by  properties  of  the  open  loop  transfer  function  might  be  dealt  with 
more  readily,  both  in  analysis  and  design,  by  methods  employing  open  loop 
shaping.  One  might  in  fact  conjecture  that  a  tradeoff  exists  between  design 
methods  employing  open  loop  transfer  functions,  in  which  design  limitations 
are  seen  directly  but  feedback  properties  can  only  be  assessed  indirectly, 
and  methods  employing  closed  loop  transfer  functions,  in  which  feedback 
properties  may  be  directly  assessed  but  plant  limitations  are  not  so  readily 
seen. 

The  purpose  of  this  chapter,  then,  is  to  investigate  whether  design 
techniques  employing  the  open  loop  transfer  function  indeed  do  have  any 
significant  advantages  over  techniques  directly  employing  closed  loop  transfer 
functions.  Emphasis  will  be  placed  upon  situations  in  which  the  class  of 
achievable  closed  loop  transfer  functions  is  constrained  by  properties 
necessarily  possessed  by  the  open  loop  transfer  function. 


Many  design  problems  could  be  used  as  a  basis  for  the  comparison 
undertaken  in  this  chapter.  The  problem  which  is  most  convenient  for  the 
present  purpose,  due  to  the  wealth  of  available  literature,  is  the  design 
of  a  system  whose  sensitivity  function  is  to  satisfy  certain  design  goals 
despite  limitations  imposed  by  open  loop  system  properties. 

Finally,  to  return  to  the  concerns  voiced  in  the  first  paragraph 
of  this  section,  suppose  it  is  indeed  true  that  open-loop  based  design 
techniques  have  important  advantages  in  the  design  of  single  loop  feedback 
systems.  Then  this  might  suggest  that  attempts  to  generalize  closed-loop 
based  design  techniques  to  multiple  loop  feedback  systems  might  not  be  as 
appropriate  as  attempts  to  generalize  open-loop  based  techniques.  Even  if 
this  conclusion  is  judged  too  strongly,  it  would  nevertheless  be  of  much 
interest  to  have  knowledge  of  what  information  and  insights  are  lost  when 
open- loop  techniques  are  abandoned.  If  this  information  and  insight  are 
indeed  critical  to  the  completion  of  a  satisfactory  design,  then  presumably 
they  would  have  to  be  obtained  from  alternate  sources  and  incorporated  in 
any  candidate  design  technique. 

Portions  of  this  chapter  have  appeared  in  preliminary  form  in  [28] . 


^.2.  The  Sensitivity  Design  Problem 

The  purpose  of  this  section  is  to  discuss  the  design  of  a  single 


loop  feedback  system  whose  sensitivity  function  is  to  satisfy  certain  design 
goals  under  bandwidth  constraints  imposed  by  properties  of  the  plant  and/or 
open  loop  transfer  function.  (Constraints  due  to  nonminimum  phase  plants 


are  studied  in  Section  4.3.)  It  will  be  argued  that  realistic  bandwidth 
constraints  place  nontrivial  limitations  upon  the  class  of  achievable 
sensitivity  functions,  that  design  methods  directly  employing  the  sensitivity 
function  must  face  the  problem  of  insuring  that  an  admissible  compensator 
will  be  obtained,  and  that  design  techniques  employing  the  open  loop  transfer 
function  may  circumvent  this  difficulty  by  considering  only  admissible 
compensators  from  the  outset. 

The  sensitivity  function  of  a  single-loop  feedback  system  is  given 
by  S(s)  =  [1  +  P(s)F(s)]  .  This  function  is  important  in  that  |S(ju)|  is  a 

measure  of  system  response  to  disturbance  inputs  and  of  sensitivity  to  small 
variations  in  the  plant  parameters  from  their  nominal  values. 

For  purposes  of  discussion,  consider  a  design  specification  requiring 
sensitivity  reduction  and  disturbance  rejection  (i.e.,  requiring  1 S(jw) { <<1) 
over  some  low  frequency  range  while  also  requiring  that  sensitivity  not  be 
excessively  large  at  any  frequency.  This  goal  may  be  formulated  quantitatively 
as  imposing  a  frequency-dependent  bound  on  the  magnitude  of  the  sensitivity 
function: 


(  S  ( j oj)  |  <  M(uj)  Vw 


(4.1) 


MgO-) 

Mg  (u)) 


^  S  [0,^] 

UJ  >  !±)^ 


(4.1a) 


(4.1b) 


Design  constraints  such  as  (4.1)  are  present  in  most  practical 
problems,  whether  stated  explicitly  or  only  implicitly.  As  discussed  in 
Chapters  2  and  3,  it  may  well  be  that  the  bound  (4.1)  cannot  be  satisfied 
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due  to  some  limitation  imposed  by  properties  of  the  open  loop  transfer  function. 
In  such  cases  it  is  relevant  to  ask  whether  it  is  possible  to  modify  M(w) 
appropriately  so  that  a  specification  which  is  achievable  and  which  also  admits 
satisfactory  designs  is  obtained.  Toward  this  end,  note  that  typically  the 
value  of  jS(joj) [  is  more  critical  to  system  performance  at  some  frequencies 
than  at  others,  and  that  satisfaction  of  design  constraints  is  certainly  more 
important  at  the  more  critical  frequencies .  Again,  for  the  purpose  of  discus¬ 
sion,  it  will  be  assumed  that  the  low  frequency  constraint  Mg(oo)  is  a  hard 
constraint  to  be  satisfied,  if  at  all  possible,  in  ord^r  to  achieve  good 
system  performance.  The  high  frequency  constraint  Ms(co),  on  the  other  hand, 
will  be  regarded  as  a  soft  constraint  whose  satisfaction,  while  desirable, 
is  not  as  critical. 

Recall  it  was  conjectured  in  the  previous  section  that  certain 
difficulties  faced  by  design  methods  employing  closed  loop  transfer  functions 
are  either  avoided  or  are  encountered  in  a  more  tractable  form  by  methods 
employing  the  open  loop  transfer  function.  The  remainder  of  this  section  is 
devoted  to  discussing  this  conjecture  in  the  case  where  the  open  loop  transfer 
function  must  satisfy  various  bandwidth  constraints.  By  bandwidth  constraints 
is  meant  both  frequency-dependent  bounds  on  | L ( j oo ) |  as  well  as  requirements 
on  the  rate  at  which  ( L  ( j  <jj)  |  0  as  (If  L(s)  is  a  rational  transfer 

function  then  the  latter  requirement  can  be  rephrased  in  terms  of  the  order 
of  the  zero  of  L(s)  at  infinity.) 

First  discussed  is  an  argument  of  Horowitz  [9,  Section  6.13] 
concerning  the  feasibility  of  design  by  directly  choosing  the  sensitivity 
function  to  satisfy  design  goals  such  as  (4.1)  while  insuring  the  resulting 
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feedback  compensator  is  proper.  Now,  as  has  recently  been  pointed  out  [16], 
any  sensitivity  function  obtainable  with  an  improper  feedback  compensator  may 
be  approximated  as  closely  as  desired  by  a  sequence  of  strictly  proper 
compensators.  Thus  objections  based  solely  upon  the  inability  to  guarantee 
that  a  strictly  proper  compensator  is  obtained  can  readily  be  overcome. 
Realistic  design  constraints,  however,  impose  much  more  severe  limitations  on 
the  class  of  admissible  compensators  than  mere  propriety.  The  implications 
of  these  constraints  for  the  method  of  design  by  direct  choice  of  sensitivity 
function  shall  be  discussed  in  detail  after  Horowitz'  argument  is  reviewed. 

Suppose  that  one  directly  chooses  a  function  S(s)  which  is  stable 
and  satisfies  the  design  constraint  (4.1).  Assume  for  simplicity  that  the 
plant  is  a  stable,  minimum  phase  rational  function.  Then  a  (possibly  improper) 
feedback  compensator  yielding  S(s)  as  the  sensitivity  function  of  the 
resulting  design  may  be  obtained  from 

F(s)  -  P"1(s) [1  -  S_1(s) ]  .  (4.2) 

Now,  in  order  to  be  realizable,  this  compensator  must  be  at  least  proper. 

This  requirement  in  turn  may  be  translated  into  an  algebraic  constraint  on 
the  sensitivity  function,  as  is  now  described.  Again,  this  discussion  is 
taken  from  Horowitz  [9,  Section  6.13]. 

Following  Horowitz,  let  the  plant  have  n  poles  and  m  zeros.  Since 
physical  plants  are  strictly  proper,  it  follows  that  the  pole-zero  excess 
e  -  n-m  must  be  greater  than  zero.  By  straightforward  manipulation, 

Horowitz  shows  that  the  sensitivity  function  must  have  the  form 
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S(s) 


sn  +  I  a  Si_1 
i-1 

n  “  .  i-1 

s  +  E  b.s 

i-1 


(4.3) 


where 


3^  —  ,  1—1^... ,m-l  . 

This  fact,  in  turn,  implies  that  the  function  1-S  ^(s)  must  have  a  zero  of 
order  (at  least)  e  at  s®“.  The  reason  for  this  constraint  may  be  seen 
from  (4.2);  it  is  necessary  in  order  that  the  compensator  F(s)  be  proper. 

Horowitz  then  proceeds  to  point  out  the  difficulty  involved  in 
choosing  a  function  S(s)  which  satisfies  both  the  design  goals  (4.1)  as 
well  as  the  constraints  (4.3).  Indeed,  by  fiddling  with  examples,  one 
readily  concludes  that  the  problem  is  not  trivial,  and  in  fact  seems  nearly 
intractable  for  all  but  very  small  values  of  e. 

To  summarize,  the  requirement  that  a  proper  compensator  be  obtained 
translates  into  an  algebraic  constraint  on  the  class  of  achievable  sensitivity 
functions.  The  necessity  of  satisfying  this  constraint  implies  that  attempts 
to  design  by  directly  choosing  the  sensitivity  function  will  meet  with 
difficulty.  The  problem  is  easily  avoided,  however,  in  open  loop  based 
design.  Since  the  compensator  F(s)  is  the  design  variable,  one  can  trivially 
insure  that  F(s)  is  proper  merely  by  choosing  it  to  be  so.  Of  course,  one 
still  has  to  verify  that  the  resulting  closed  loop  system  is  stable  and  that 
jSCju.) |  satisfies  design  goals.  These  may  themselves  not  be  trivial  problems; 
however,  classical  Nyquist/Bode  techniques  have  proven  to  be  effective  in 


establishing  the  relationship  between  L(joj)  and  S(jw).  Moreover,  if  a  given 
design  is  judged  unsatisfactory,  classical  techniques  can  provide  insight 
into  how  the  compensator  might  be  modified  to  alter  sensitivity  properties. 

Now,  in  [9],  Horowitz  ended  his  discussion  after  concluding  that 
choosing  S(s)  to  satisfy  both  (4.1)  and  (4.3)  is  at  best  difficult.  It  is 
possible,  however,  to  continue  the  analysis  to  show  that,  for  e  >  2,  the 
goals  (4.1)  and  constraint  (4.3)  may  actually  be  mutually  incompatible,  rhus 
not  only  is  the  problem  difficult,  it  may  well  be  theoretically  impossible. 
This  can  be  showr.  by  using  the  analyticity  of  S(s)  to  translate  the  algebraic 
constraint  on  S(°°)  into  a  constraint  on  |S(ju>)|,  Vcu.  For  e  >  2,  the  Bode 
integral  relation  (Theorems  2.1  and  3.3)  implies  that 

„  N 

00  p 

/  log | S (joj)  | doi  *  S  Re[p.]  (4.4) 

0  i-1 


where  (p^  ;  i=*l,...,Np}  is  the  set  of  unstable  poles  o-f  L(s)  .  (Although  not 
stated  in  the  literature  or  in  Chapter  3,  there  also  exist  integral  constraints 
on  the  first  e-2  derivatives  of  the  sensitivity  function.) 

At  this  point  it  should  be  noted  that  the  problem  as  posed  thus  far 
is  not  completely  realistic.  In  practice,  the  class  of  admissible  feedback 
compensators  is  constrained  by  requirements  much  more  severe  than  mere 
propriety.  In  particular,  not  only  is  the  order  of  the  zero  of  L(s)  at 
infinity  constrained,  but  also  limitations  on  the  bandwidth  of  L(s)  are 
present.  These  latter  limitations  in  turn  impose  constraints  on  the  band¬ 
width  of  the  feedback  compensator  F(s).  (Also,  note  the  approximation  schemes 
of  [16]  are  not  applicable  when  the  bandwidth  of  the  compensator  is 
constrained.)  Two  practical  reasons  why  the  bandwidth  of  the  open  loop 
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transfer  function  must  be  constrained  are  the  natural  bandwidth  limitations 
of  the  plant  and  the  necessity  of  rolling  off  the  open  loop  gain  to  provide 
stability  robustness  (for  example,  to  avoid  exciting  unmodelled  high  frequency 
dynamics).  Thus  in  realistic  situations  S(s)  must  be  chosen  so  that  the 
resulting  open  loop  transfer  function  satisfies  a  bandwidth  constraint  such 
as  that  discussed  in  Section  3.3, 

|L(jw)|  <  -£k  -  £  *  (4‘5) 

US 

for  u>  >  us  ,  e<l  and  k>0.  As  the  Corollary  in  Section  3.3  shows,  the  tail 
of  the  integral  in  (4.4)  must  satisfy 

°°  log  [T"^ — ]us 

/  log|s(joj)  Ida.  < - -^-£  -C  .  (4.6) 

wc 

Thus,  requiring  that  L(s)  satisfy  constraints  such  as  (4.5)  imposes  even  more 
severe  limitations  upon  the  class  of  achievable  sensitivity  functions.  Hence, 
closed  loop  based  design  techniques  are  faced  with  the  difficulty  that 
candidate  sensitivity  functions  must  yield  a  compensator  lying  in  a  rather 
severely  constrained  class.  Now, it  is  true  that  the  class  of  function  S(s) 
yielding  such  compensators  may  be  characterized  directly  in  terms  of  |S(jus)j 
using  (4.4)  and  (4.6).  However,  the  problem  is  still  nontrivial  since  it  is 
difficult  to  look  at  a  plot  of  log  |S  (jus)!  and  determine  whether  the  necessary 
integral  constraints  are  satisfied.  The  problem  is  particularly  insidious 
since  the  value  of  |S(jui)|  is  directly  related  to  system  performance.  Since 
small  values  of  |s(jus)|  indicate  desirable  sensitivity  properties,  the  tendency 
would  be  to  attempt  a  design  for  which  the  integral  of  log j S ( j oo)  |  in  (4.4) 
is  negative.  Since  increasing  {S(jus)j  at  some  frequency  results  in  poorer 
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performance  at  that  frequency,  one  would  like  some  insight  into  how  this 

« 

might  be  accomplished  so  that  design  quality  is  sacrificed  no  more  than  is 
absolutely  necessary.  Again,  open  loop  based  design  techniques  approach 
this  problem  by  limiting  the  choice  of  candidate  compensators  to  those  which 
are  admissible.  Moreover,  the  Bode  gain-phase  relations  and  Nyquist  plots 
may  be  used  to  provide  insight  into  the  fact  that  the  rate  at  which  loop  gain 
rolls  off  affects  the  value  of  the  sensitivity  function.  To  summarize,  the 
difficulty  in  satisfying  constraints  on  admissible  sensitivity  functions 
is  readily  dispensed  with  and  insight  to  aid  in  designing  a  compensator 
yielding  satisfactory  closed  loop  properties  is  available. 


4.3.  Design  Limitations  Due  to  Nonminimum  Phase  Plants 

The  purpose  of  this  section  is  to  discuss  design  limitations  which 
are  present  as  a  consequence  of  the  plant  (more  generally,  the  open  loop 
transfer  function)  being  nonminimum  phase  due  to  the  presence  of  zeros  in  the 
open  right  half  plane.  These  limitations  restrict  the  class  of  achievable 
sensitivity  specifications  and  thus  pose  obstacles  to  any  design  scheme 
whose  goal  is  satisfaction  of  such  specifications.  As  in  the  previous 
section,  a  comparison  will  be  made  between  design  techniques  based  upon  the 
open  loop  transfer  function,  i.e.,  those  which  use  the  feedback  compensator 
as  the  design  variable,  and  a  hypothetical  design  procedure  which  uses  a 
candidate  sensitivity  function  as  the  design  variable.  The  latter  technique 
must,  of  course,  insure  that  the  resulting  feedback  compensator  yields  no 
unstable  pole/zero  cancellations  in  the  open  loop  transfer  function.  The 


two  design  schemes  compared  may  be  referred  to  as  "design  by  direct  choice 
of  the  compensator"  and  "design  by  direct  choice  of  the  sensitivity  function, 
respectively. 

When  a  feedback  system  is  designed  around  a  nonminimum  phase  plant, 
internal  stability  requires  that  the  feedback  compensator  possess  no  unstable 
poles  at  the  right  half  plane  zeros  of  the  plant.  This  constraint  on  the 
class  of  admissible  compensators  implies  that,  to  be  achievable,  the 
sensitivity  function  must  satisfy  the  following  algebraic  constraints  at 
each  right  half  plane  plant  zero  z  of  multiplicity  m: 

S(z)  -  1 
di 

^-jSl  *0  i«l,...,m-l  .  (4.7) 

ds  's*z 

If  a  proposed  sensitivity  function  violates  (4.7),  then  the  feedback  compen¬ 
sator  obtained  from  (4.2)  will  have  one  or  more  unstable  poles  at  right 
half  plane  plant  zeros.  Experience  with  some  examples  should  convince  the 
reader  that  choosing  a  function  S(s)  satisfying  one  or  more  constraints 
(4.7)  while  also  achieving  design  goals  such  as  (4.1)  may  be  quite  difficult. 
(In  fact,  as  shown  in  Chapter  3,  it  may  well  be  impossible.  This  will  be 
reviewed  shortly.)  To  summarize ,  if  the  constraints  (4.7)  are  not  satisfied, 
then  the  compensator  F(s)  obtained  from  (4.2)  will  be  inadmissible  due  to 
unstable  pole-zero  cancellations  in  L(s).  On  the  other  hand,  if  these 
constraints  are  satisfied,  but  the  design  goals  (4.1)  are  not  achieved,  then 
it  isn't  clear  that  insight  is  provided  into  how  S(s)  might  be  modified  to 
better  achieve  design  goals  while  maintaining  internal  stability. 


MM 


Classical  design  techniques  based  upon  directly  choosing  the 


feedback  compensator  can,  on  the  other  hand,  avoid  the  difficulty  of 


satisfying  the  constraints  (4.7)  simply  by  selecting  only  candidate  feed¬ 


back  compensators  which  have  no  poles  at  right  half  plane  plant  zeros. 


Insight  into  the  relation  between  the  feedback  compensator  and  the  resulting 


sensitivity  function  is  available  from  Nyquist  and  Bode  plots.  In  particular, 


the  difficulty  caused  by  the  extra  phase  lag  in  L(jco)  contributed  by  the 


right  half  plane  zeros  is  readily  seen. 


Although  insight  into  the  qualitative  effects  of  right  half  plane 


plant  zeros  has  long  been  available,  only  recently  have  precise  quantitative 


statements  of  their  impact  upon  feedback  design  been  obtained.  As  shown  in 


Chapter  3,  the  algebraic  constraints  (4.7)  have  equivalent  expressions  in 


terms  of  integral  relations  constraining  the  behavior  of  log  S(j<u).  These 


relations  are  significant  in  that  they  express  limitations  due  to  right  half 


plane  zeros  in  terns  of  the  sensitivity  function  evaluated  along  the  ju-axis, 


where  design  specifications  are  imposed. 


For  example,  the  constraint  S(z)  *  1  is  equivalent  to  (see  Theorem 


flog  I B  ^(z)  |  -  /  log|S(jw)  |de  (u>) 


(4.8) 


where  B^(s)  is  the  Blaschke  product  of  unstable  plant  and  compensator  poles 


2  ,  ,  ,2 

x  +  (y-oj) 


x  +  jv 


(4.9) 


■  v  r :  v.-  -v 


(If  z  is  complex,  then  there  exists  an  additional  constraint  on  *S(jco);  if 


z  has  multiplicity  greater  than  one,  then  there  exist  constraints  on 
d1 

— r  log  S)  .  Before  proceeding,  some  relevant  observations  concerning  the 
ds 

integral  constraint  (4.8)  will  be  reviewed. 

First,  since  (B^^z)  |  >  1,  and  since  (4.9)  is  positive  Vu>,  it 
follows  that,  if  |S(jto)|  <1  over  some  frequency  range,  then  necessarily  there 
exist  other  frequencies  for  which  |S(jui)  j  >  1.  Thus  a  tradeoff  exists  among 
sensitivity  properties  in  different  frequency  ranges.  Moreover,  since  the 
weighted  length  of  the  joi-axis  is  finite  (and  equal  to  it),  this  tradeoff 
must  be  accomplished  primarily  over  a  finite  frequency  range. 

Next,  consider  the  implications  of  this  tradeoff  upon  the  ability 
of  a  feedback  system  to  satisfy  both  (4.1)  and  (4.7).  As  shown  in  Chapter  3, 
one  may  derive  a  lower  bound  on  fsll  =  sup  |S(ju>)|,  given  that  (4.1a)  is 
satisfied.  Using  this  lower  bound,  one  may  then  analyze  whether  it  is  also 
possible  to  satisfy  (4.1b).  Thus  one  can  actually  prove  that  certain  design 
specifications  are  unachievable  given  the  constraints  imposed  by  the  right 
half  plane  plant  zeros. 

The  question  now  arises  as  to  whether  the  integral  relations  derived 
in  Section  3.2  may  provide  sufficient  insight  to  allow  one  to  design  by 


directly  choosing  the  sensitivity  function.  Certainly  some  insight  may  be 


obtained  from  the  function  6^(0)),  which  quantities  the  notion  of  proximity 
of  a  point  on  the  jw-axis  to  a  right  half  plane  zero.  The  presence  of  this 


term  in  (4.8)  shows  that  the  price  paid,  in  terms  of  sensitivity  increase. 


for  obtaining  a  given  level  of  sensitivity  reduction  over  a  given  frequency 


interval  is  a  function  of  the  location  of  that  interval  relative  to  the  zero. 
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For  example,  at  sufficiently  high  frequencies  the  weighting  becomes  very 
small.  Hence  the  value  of  |S(jw)|  must  be  either  very  large  or  very  small 
at  those  frequencies  in  order  to  contribute  to  the  value  of  the  integral 
(4.8).  On  the  other  hand,  requiring  sensitivity  to  be  very  small  over  a 
frequency  range  containing  y  =  Im[z]  can  lead  to  large  values  of  sensitivity 
elsewhere.  This  effect  is  exacerbated  if  | S ( j to) |  is  only  allowed  to  exceed 
unity  over  frequency  intervals  whose  weighted  length  is  small. 

Now  consider  the  problem  of  design  by  directly  choosing  the 
sensitivity  function  when  the  plant  has  a  single  (real)  right  half  plane 


zero  x.  For  simplicity,  assume  that  the  plant  is  stable.  Let  S^(s)  be  a 


candidate  sensitivity  function.  From  the  Lemma  in  Appendix  A  it  follows 
that 


Trlog|si(x)  I  -  /  log|S1(ju)  |dex(u)) 


This  equation  is  significant  since  S^(s)  has  an  internally  stable  realization 
only  if  S^(x)  ■  1.  Now,  suppose  this  condition  is  not  met,  and  assume  further 
that  |S^(x)|<l.  (Since  good  sensitivity  is  achieved  by  requiring  that  |s^(juj)| 
is  sufficiently  less  than  one,  this  is  surely  the  most  likely  case,  at 
least  in  an  initial  attempt.)  Now,  if  indeed  !s^(x)[<  1,  it  will  be  necessary 
to  choose  a  new  sensitivity  function  S^Cs)  with  larger  values  of  ^(jw)!  over 


some  frequency  range.  The  weighting  function  yields  at  least  some 


insight  into  how  this  may  be  done.  For  example,  choosing  a  new  sensitivity 
function  whose  magnitude  is  larger  only  at  lightly  weighted  frequencies  will 
not  generally  be  effective;  one  must  increase  the  magnitude  at  frequencies 


for  which  the  weighting  i s  significant.  If  it  turns  out  that 


significantly  large  values  of  the  weighting  lie  only  within  the  frequency 
range  over  which  sensitivity  is  desired  to  be  small,  then  nontrivial  com¬ 
promises  in  system  performance  will  have  to  be  made.  Whether  enough  insight 
is  generally  available  from  the  weighting  function  6^(0))  to  enable  such 
compromises  to  be  made  satisfactorily  remains  to  be  seen. 

In  any  event,  as  will  now  be  shown,  the  fact  that  P(s)  was  assumed 
to  have  only  one  zero  in  the  right  half  plane  facilitates  design  of  S(s). 

This  follows  since  each  candidate  sensitivity  function  may  easily  be  modified 
to  produce  a  new  candidate  function  yielding  an  internally  stable  design. 

All  that  is  necessary  to  perform  the  modification  is  to  define  the  new 

(jw) |  and  |S.(j«) |  may  have  very 
different  values.  Suppose,  however,  that  a  function  S^s)  is  obtained  which 
satisfies  design  goals  sufficiently  well  and  for  which  Si(x)Ssl.  Then  S^s) 
defined  in  this  fashion  will  both  yield  an  internally  stable  design  and, 

.  |s1U“>l 

since  |S.  (jw)|  =  — .  ■  .  ,will  likely  come  close  to  satisfying  the  design 

goals.  Of  course,  this  strategy  works  because  the  plant  was  assumed  to  have 
but  a  single  right  half  plane  zero.  If  more  than  one  zero  (even  a  complex 
conjugate  pair  of  zeros)  is  present,  then  simple  scaling  will  not  generally 
yield  a  function  satisfying  all  the  necessary  constraints. 

Finally,  consider  once  again  the  approach  to  this  problem  involving 
open  loop  shaping;  i.e.,  "design  by  direct  choice  of  the  compensator."  From 
a  Nyquist  diagram  one  can  see  that  the  frequency  response  of  the  open  loop 
transfer  function  must  eventually  penetrate  the  unit  circle  centered  at  the 
critical  point.  Thus  there  will  necessarily  exist  a  frequency  range  for 
which  |S(joj)[  >1;  in  particular,  there  will  exist  a  peak  in  the  sensitivity 


A  Si(s) 

function  S^s)  =  s~(x)  ‘  course»  I  ^ 


function  near  the  point  on  the  Nyquist  plot  of  L(jto)  closest  to  the  critical 

■S 

point.  The  size  of  this  peak  may  be  varied  via  lead  or  lag  compensation,  but 
its  existence  will  be  unavoidable.  Moreover,  it  can  also  be  seen  that,  if  -V 

the  size  of  the  peak  is  not  to  be  excessively  large,  then  the  open  loop  gain 
must  be  rolled  off  before  the  phase  lag  produced  by  the  zero  becomes  too 
great.  Again,  whether  such  insights  are  as  readily  available  in  the  design  \I 

procedure  involving  direct  choice  of  the  sensitivity  function  remains  an 

rt 

open  question. 


4.4.  Sensitivity  Design  via  H°°-Optimization 

The  purpose  of  this  section  is  to  discuss  some  properties  of  a 
recently  proposed  approach  to  the  problem  of  feedback  design.  This  approach, 
which  utilizes  H°°-optimization  methods,  employs  closed  loop  transfer  functions 
directly  in  the  design  process.  The  specific  design  problem  discussed  in  this 
section  is,  as  in  the  rest  of  this  chapter,  the  achievement  of  good  sensitivity 
properties.  The  solution  to  the  relevant  H°°-optimization  problem  is  discussed 
at  length  by  Zames  and  Francis  [16].  It  should  be  emphasized  that  the  design 
goals  in  [16]  are  not  the  satisfaction  of  frequency-dependent  norm  bounds 
such  as  (4.1).  Other  authors,  however,  have  considered  satisfaction  of  norm 
bounds  as  the  goal  of  H°°-optimization .  The  results  of  [16]  are  used  in  the 
present  analysis  for  availability  and  for  convenience. 

As  discussed  earlier  in  this  chapter,  feedback  design  methods  which 
directly  employ  closed  loop  transfer  functions  may  be  faced  with  difficulties 
which  are  either  avoided  or  rendered  more  tractable  by  classical  methods 


m 
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employing  the  open  loop  transfer  function.  Again,  it  is  conjectured  that 
there  may  be  a  real  tradeoff  between  design  techniques  dealing  directly  with 
closed  loop  transfer  functions,  for  which  limitations  imposed  by  properties 
of  the  open  loop  plant  may  only  be  dealt  with  indirectly,  and  design 
techniques  dealing  directly  with  open  loop  transfer  functions,  for  which 
the  feedback  properties  of  a  design  may  only  be  assessed  indirectly.  Thus, 
once  the  H00  approach  has  been  described,  it  will  be  examined  to  see  whether 
difficulties  in  dealing  with  open  loop  plant  limitations  are  present  in  this 
closed  loop  based  design  technique  also . 

The  H00  sensitivity  minimization  problem  is  to  find,  among  all 
stabilizing  feedback  compensators,  one  for  which  a  weighted  infinity  norm 
of  the  sensitivity  function  is  minimized: 

min  fWSj^  »  min  sup  |w(ju;)S(juj)  |  .  (4.10) 

F  F  oo 

The  stable,  minimum  phase,  weighting  function  W(s)  is  to  be  supplied,  via  some 

* 

procedure,  by  the  designer.  For  nonminimum  phase  plants,  internal  stability 
requires  that  the  constraints  (4.7)  be  satisfied  at  each  open  right  half 
plane  plant  zero.  Thus  the  optimization  problem  (4.10)  is  constrained  by 
(4.7).  However,  the  limitations  on  feedback  design  imposed  by  (4.7),  which 
were  interpreted  in  Section  4 . 3  as  tradeoffs  among  levels  of  sensitivity  along 
the  jw-axis,  are  treated  in  this  framework  merely  as  algebraic  constraints 

The  reader  is  cautioned  that  the  term  weighting  function  has  two  uses  in  this 
chapter.  These  should  be  distinguishable  by  context.  One  use  is  to  refer  to 
the  functions  0z(uj)  and  0_ (uj)  which  appear  in  the  integral  constraints  due  to 
right  half  plane  zeros  ana  poles  (Theorems  3.1  and  3.2).  The  other  use  is  to 
refer  to  the  function  W(s)  used  as  a  design  variable  in  the  H°°-optimization 
process . 
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imposed  upon  the  sensitivity  function  at  isolated  points  in  the  open  right 
half  plane.  The  significance  of  this  distinction  will  become  apparent  below. 

The  solution  to  the  problem  (4.10)  given  in  [16]  shows  that,  given 
the  existence  of  a  nonminimum  phase  plant,  the  optimal  unweighted  sensitivity 
function  has  the  form 


S*Cs) 


Bp(s)B] 


.*(s)W-1(s) 


(4.11) 


where  Bp(s)  is  the  Blaschke  product  of  unstable  plant  poles.  The  Blaschke 
product  Bp*(s)  contains  unstable  poles  of  the  optimal  feedback  compensator. 
There  are  at  most  one  less  of  these  poles  than  the  number  of  open  right  half 


plane  plant  zeros.  The  unstable  pole  locations,  as  well  as  the  value  of  the 
constant  k*,  are  determined  by  the  constraints  (4.7). 

Since  Blaschke  products  are  allpass  of  unit  magnitude,  it  follows 
from  (4.11)  that 

I S*( ju) |  =  \k*\ •iW_1(jw) |  .  (4.12) 

Thus  the  weighting  function  determines  the  shape  of  the  sensitivity  function; 
i.e.,  its  relative  magnitude  at  different  frequencies.  The  level  of  sensitiv¬ 
ity  reduction,  however,  is  determined  by  the  constraints  (4.7)  at  the  right 
half  plane  plant  zeros. 

One  possible  way  to  construct  the  weighting  function  is  to  choose 

W(s)  =*  M_1(s)  (4.13) 

where  M(s)  is  a  stable  minimum  phase  transfer  function  with  | M ( j oj )  j  =  M(oj)  , 

Vw,  and  M(w)  is  given  by  the  specification  (4.1).  Using  this  procedure, 
the  H°°-optimal  solution  satisfies  the  specification  (4.1)  if 
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fWS*^  =  |W(ju)S*(ju)|  <  1  .  (4.14) 

Moreover,  if  JWS  jj ^  >  1,  then  the  specification  (4.1)  is  unachievable.  Since 
the  optimal  weighted  sensitivity  function  W(s)S  (s)  is  allpass  (4.10)-(4.11)  , 
it  follows  in  turn  that  if  (4.1)  is  violated,  then  it  is  violated  equally 
at  all  frequencies. 

Now,  one  advantage  of  the  H"-optimization  procedure  is  that  it 

guarantees  the  constraints  (4.7)  will  be  satisfied.  The  design  goals  (4.1), 

on  the  other  hand,  may  well  be  violated,  whether  the  weighting  function  is 

chosen  to  directly  reflect  these  goals,  as  in  (4.13),  or  not.  This  follows 

from  the  fact  that,  although  the  designer  can  specify  the  shape  of  the 

optimal  unweighted  sensitivity  function,  the  level,  or  magnitude,  is  determined 
* 

by  the  constant  k  which  is  determined  by  the  locations  of  the  right  half 
plane  plant  zeros. 

Suppose,  then,  that  a  weighting  function  has  been  chosen  and  the 
resulting  optimal  unweighted  sensitivity  function  violates  the  design  goals . 

In  addition,  suppose  that  it  is  desired  to  choose  a  new  weighting  function 
(perhaps  by  suitable  modification  of  the  existing  weighting)  so  that  the 
new  optimal  sensitivity  satisfies,  for  example,  the  hard  design  specification 
(4.1a).  Of  course,  this  would  entail  violating  the  soft  design  specification 
(4.1b).  In  view  of  the  presumed  relative  importance  of  the  hard  and  soft 
specifications,  this  is  not  unreasonable  provided  the  v'olation  of  (4.1b) 
is  not  excessive.  The  question  which  naturally  arises  is:  At  which  fre¬ 
quencies,  and  by  how  much,  should  the  weighting  be  modified  in  order  to 
accomplish  this  task? 
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One  might  hope  that  some  insight  into  this  question  would  be 
furnished  by  the  manner  in  which  the  current  design  violates  the  design 
specifications.  Certainly  classical  design  techniques  provide  some  such 
insight.  For  example,  looking  at  Nyquist  and  Bode  plots  not  only  reveals 
when  a  given  design  has  poor  sensitivity,  but  also  gives  information 
concerning  how  to  use  lead  and/or  lag  compensation  to  modify  the  existing 
design.  In  addition,  these  techniques  provide  some  insight  into  how  the 
additional  phase  lag  contributed  by  right  half  plane  zeros  complicates  the 
design  process.  In  particular,  qualitative  insight  into  how  the  location  of 
the  zero  limits  the  frequency  range  over  which  large  open  loop  gain  may  be 
achieved  is  provided. 

Note,  however,  that  H“  designs  violate  (4.1)  even  at  those  fre¬ 
quencies  which  are  far  away  from  any  right  half  plane  zeros  and  for  which  the 
value  of  |S(joj)|  contributes  little  to  the  tradeoff  imposed  by  the  integral 
constraint  (4.8).  While  the  fact  that  the  specification  is  violated 
equally  at  all  frequencies  isn't  necessarily  bad  by  itself,  it  doesn't  seem 
to  furnish  much  insight  into  how  difficult  various  portions  of  the  specifica¬ 
tion  are  to  satisfy.  Consequently,  it  would  seem  that  insight  into  how  the 
weighting  function  might  be  modified  must  be  obtained  from  other  sources. 

This  issue  will  be  explored  in  the  next  section. 

4.5.  Relations  Between  W(s)  and  js*(1iu)| 

In  this  section  the  relation  between  the  weighting  function  and  the 
optimal  unweighted  sensitivity  function  is  investigated.  The  purpose  is  to 
examine  issues  which  must  be  considered  in  attempting  to  modify  the  weighting 


W(s)  to  obtain  a  sensitivity  function  satisfying  frequency-dependent  magnitude 
bounds,  such  as  (4.1),  over  at  least  some  frequencies. 

First  from  (4.7)  and  (4.11)  it  may  be  seen  that  the  following 
conditions  must  be  satisfied  at  each  right  half  plane  plant  zero  z  of 
multiplicity  m: 


1  -  k*Bp(z)BF*(z)w"1(z) 


0  "  VX1 

,  i  P  F 
ds  [ s=z 


(4.15) 


1=1, ... ,m-l 


From  (4.15)  it  follows  that  if  the  plant  possesses  only  one  (real)  right  half 
plane  zero  (so  that  BF*(s)  =  1)  then 


|k*|  =  iW(z)B~1(z) 


(4.16) 


If  more  than  one  zero  is  present,  the  value  of  k  increases  as  a  function 


of  the  proximity  of  thi  unstable  compensator  poles  to  the  zeros: 

V  p  +  z 

!BF*(3S)  I  *  I  n  >  1  • 

*  i=l  pi  Z 


Suppose,  however,  that  the  conditions 


W (Zj)  *  W(zk) 


Vj,k 

and,  at  each  zero  of  multiplicity  m>  1, 

-1  ’  dV1 


(4.17a) 


ds 


i  BP 


w"1(2)  +  Bp(z) 


ds 


s=z 


i  1 , . . . ,m-l 


(4.17b) 


s*z 


are  satisfied  (which  may  not  be  possible  for  open  loop  unstable  nonminimum 


phase  plants).  Then  (4.15)  can  be  satisfied  with  Bp*(s)  *  1.  Hence  the 
optimal  compensator  is  stable  and  |k*|  again  satisfies  (4.16). 

In  any  event,  at  each  open  right  half  plane  zero  the  optimal 
unweighted  sensitivity  function  must  satisfy 

I  S*(jto)  |  >  |W(2)j.|B“1(z)|-|w‘1(jo))|  (4.18) 

with  equality,  holding  if  and  only  if  (4.16)  holds. 

Motivated  by  the  discussion  of  the  previous  section,  suppose  one 
wishes  to  modify  an  existing  weighting  function  so  that  |S  (jw)  |  obtained 
from  the  new  weighting  satisfies  a  magnitude  bound  in  some  frequency  range 
without  becoming  excessively  large  at  other  frequencies.  From  (4.11)  and 

*dp 

(4.15)  it  follows  that  simply  scaling  W(s)  by  a  constant  leaves  S  (s) 

i  *  i 

unchanged.  Thus  in  order  to  affect  jS  (jco)  |  it  is  necessary  to  change  the 
relative  magnitude  of  |w(joj)|  at  different  frequencies.  This  will  now  be 
discussed. 

Assume  first  that  P(s)  has  only  one  right  half  plane  zero,  so  that 
(4.18)  is  an  equality.  Then  (4.18)  shows  that  |S  (joj)  |  is  a  function  of  both 
the  magnitude  of  the  weighting  at  the  zero  and  the  weighting  along  the  ju-axis 

1*1 

If  these  quantities  were  independent,  then  one  could  reduce  jS  (jw) |  at  some 
frequency  simply  by  increasing  the  magnitude  of  the  weighting  at  that 
frequency  and/or  by  reducing  the  magnitude  at  the  .zero.  Unfortunately, 
however,  the  magnitude  of  the  weighting  at  the  zero  is  not  independent  of 
its  value  along  the  jw-axis.  In  fact,  |W(jw) |  is  completely  determined  by 
{|W(ju))|,  Voj}.  This  follows  since,  at  each  open  right  half  plane  zero,  z,  of 
multiplicity  m,  the  weighting  satisfies  the  integral  relations  (see 


Appendix  A) : 


trlog  |  W(z) 

1  -  /  log  |  W(joj)  |  d6  (oj) 
z 

—  00 

(4.19a) 

7T*  W(z)  ■ 

00 

/  *W(ju>)d6  (u>) 
-00  2 

(4.19b) 
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tt  ^-r  logW 

3  j  - _  logW 

d0  (w)  i“l,...,m-l 

(4.19c) 

ds 

s-z  ”  ds 

Ct 

S*j(D 

de 

where  ——  is  given  by  (4.9). 

dot 

From  (4.19a)  it  follows  that  increasing  the  magnitude  of  the 
weighting  along  the  jw-axis  can  only  increase  the  magnitude  of  the  weighting 
at  the  zero.  Similarly,  |W(z)  |  can  be  decreased  only  by  decreasing  |W(ju>)  | 
over  some  frequency  range.  Hence,  from  the  above  discussion  of  (4.18)  and 
its  implications,  it  follows  that  attempts  to  modify  the  weighting  to  reduce 
|S  (jui)  |  will  be  complicated  by  these  two  opposing  tendencies. 

Thus,  in  order  to  modify  |W(ju>)  |  to  affect  |S  ( j to)  |  in  some  desired 
fashion,  it  appears  necessary  to  have  a  good  understanding  of  the  relation 
between  |W(Juj)|  and  |w(z)|.  This  relation  is  not  completely  transparent; 
however,  some  insight  may  be  gained  from  (4.19a).  For  example,  at  high 

de 

frequencies  for  which  is  small,  changing  |W(jw) |  will  have  relatively 

little  effect  upon  the  value  of  |w(z) ] .  Thus, attempting  to  meet  a  low 

1*1 

frequency  specification  on  |S  (ju) |  by  decreasing  the  weighting  in  a  high 

d9 

frequency  range  will  be  effective  only  if  -jj-  is  relatively  large  in  this 
range.  Otherwise  the  effect  will  be  to  increase  sensitivity  at  high 
frequencies  with  very  little  attendant  decrease  at  low  frequencies. 

If  the  plant  has  more  than  one  right  half  plane  zero,  then  in 


general  the  bound  (4.18)  will  not  be  tight  due  to  the  presence  of  unstable 


compensator  poles.  Nevertheless,  if  this  lower  bound  is  too  large  for  any 
zero,  the  above  comments  apply  to  attempts  to  reduce  the  bound.  Many  plants 
with  more  than  one  right  half  plane  zero  (including  all  stable  plants)  can 
be  used  in  a  stable  feedback  system  with  a  stable  compensator.  For  these 
plants  an  alternate  procedure  is  to  require  the  weighting  to  satisfy  (4.18). 
This  requirement  may  also  be  stated  in  terms  of  W(ju)  using  (4.19). 

In  Section  4.4  it  was  pointed  out  that  design  limitations  due  to 
nonminimum  phase  plants  are  treated  in  the  H"  framework  as  constraints 
imposed  at  points  in  the  right  half  plane,  rather  than  along  the  jw-axis. 
From  the  preceding  discussion,  however,  it  appears  that  interpreting  the 
constraints  as  tradeoffs  imposed  along  the  ju-axis  is  necessary  if  the 
weighting  function  is  to  be  shaped  to  satisfactorily  affect  the  sensitivity 
function.  Moreover,  it  has  been  argued  that  it  is  necessary  to  consider 
the  relation  between  |w(jui)  |  and  |W(z)  |  .  Qualitative  insights  into  this 
relation  are  available;  however,  whether  these  insights  are  sufficient  to 
reliably  produce  a  satisfactory  design  seems  to  be  an  open  question. 


4.6.  Comparison  between  Design  by  Directly  Choosing  the  Sensitivity  Function 
and  Design  via  ^-Optimization 

Two  procedures  have  now  been  discussed  for  designing  a  system  with 
satisfactory  sensitivity  properties  when  the  plant  is  nonminimum  phase. 

Both  these  procedures  employed  a  closed  loop  transfer  function,  specifically, 
the  sensitivity  function,  in  performing  the  design.  The  first,  discussed  in 
Section  4.3,  consisted  of  iterating  a  candidate  sensitivity  function  S^s), 


until  one  which  satisfied  the  constraints  (4.7)  was  obtained.  This  technique 


had  the  advantage  that  one  could  choose  the  level  of  sensitivity  reduction 
js^Cjo))]  and  the  disadvantage  that  an  internally  stable  system  is  generally 
not  obtained. 

The  second  design  scheme,  discussed  in  Sections  4.4  and  4.5,  was 
the  H°°-optimization  method.  This  technique  involved  iterating  a  weighting 
function  W^(s)  in  order  to  obtain  a  satisfactory  sensitivity  function, 
and  had  the  advantage  that  an  internally  stable  design  is  always 
achieved  (albeit  with  generally  improper  and  possible  unstable  compensators) . 
The  disadvantage  of  this  technique  is  that  the  level  of  sensitivity  reduction 
along  the  jw-axis  is  determined  a  posteriori  via  the  optimization  process, 
and  consequently  cannot  be  controlled  directly. 

The  perceptive  reader  will  have  noticed,  however,  that  the 
difficulty  involved  in  both  techniques  arises  from  the  same  source;  namely, 
the  fact  that  the  value  of  a  stable  minimum  phase  rational  function  at  a 
point  in  the  right  half  plane  is  completely  determined  by  its  values  along 
the  ju-axis.  Consequently,  these  two  quantities  cannot  be  manipulated 
independently  in  design.  Thus,  although  the  two  design  procedures  yield 
intermediate  results  with  different  properties,  the  ultimate  problem  which 
must  be  solved  by  each  is  the  same.  This  may  be  shown  explicitly  using  the 
following  theorem. 

Theorem  4.1:  Let  P(s)  be  a  nonminimum  phase  plant  satisfying  the  assumptions 
of  [16,  Section  II,  C.l)].  Let  F(s)  be  a  choice  of  feedback  compensator  and 
define  the  sensitivity  function  S(s)  ■  [l  +  P(s)F(s)J  Suppose  that 

(i)  S(s)  has  no  poles  in  Re[s]  >  0, 

(ii)  S(s)  satisfies  Condition  (4.7)  at  the  open  right  half  plane 
zeros  of  P(s)  and,  at  each  right  half  plane  plant  pole,  p,  of  multiplicity  n. 


S(p)  -  0 


ds 


i“l, . . . ,n-l 


s-p 


z  z 

(ill)  F(s)  has  <  Np  -  1  unstable  poles,  where  Np  is  the  total  number 
of  open  right  half  plane  zeros  of  P(s), 


by  (3.11), 


(iv)  a  e  >  0  such  that  inf  |S(s)|  >  e,  where  S(s)  is  defined 

Re[s]  >0 


(v)  3  a >  0,  £  >  0,  and  R>  0  such  that  if  |sj  >  R  and  Re[s]  >  0,  then 

|  S(s)  |  <  a |  s  | 4. 

Conditions  (i)-(v)  are  necessary  and  sufficient  to  Insure  a  weighting  function 
W(s)  exists  which  yields  the  compensator  F(s)  as  the  solution  to  the  weighted 
optimization  problem  (4.10).  The  appropriate  weighting  function  is  given  by 


W(s)  -  cBp(s)BF(s)S-1(s) 


(4.20) 


where  Bp(s)  is  the  Blaschke  product  of  unstable  plant  poles,  Bp(s)  is  the 
Blaschke  product  of  unstable  compensator  poles,  S(s)  is  the  sensitivity 
function  for  the  feedback  F(s),  and  c^0  is  an  arbitrary  constant.  ■ 

Proof :  (Necessity)  Suppose  that  W(s)  is  a  weighting  function  yielding  an 

optimal  conpensator  F  (s)  *  F(s) .  Then  it  follows  from  (4.11)  that 

S*(s)  ■  kBp(s)Bp(s)w”1(s) 

Since  H°°-optimal  solutions  are  internally  stable,  (i)  and  (ii)  are  satisfied. 

k  2 

From  the  fact  that  F  (s)  has  at  most  Np  -  1  unstable  poles,  it  follows  that 
F(s)  must  satisfy  (iii) .  Finally,  from  assumptions  necessary  to  insure  that 
W(s)  is  a  valid  weighting  function  [16],  (iv)  and  (v)  are  satisfied. 
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Sufficiency:  Let  F(s)  be  a  feedback  compensator  with  S(s)  the  corresponding 

sensitivity  function,  and  suppose  that  assumptions  (i)-(v)  are  satisfied. 

Then  W(s), given  by  (4.20),  satisfies  conditions  the  conditions  in  [16]  to  be 

a  valid  weighting  function.  By  assumption  (iii)  the  function  S(s)  =  cBp(s) 

Bp(s)W  1(s) ,  where  c  and  Bp(s)  are  as  in  (4.20),  has  the  appropriate  form 

(4.10).  A  simple  calculation  shows  that  W(z)  *  cB_(z)B_(z)  and  — 7  W  * 

^i  ds  s*z 

c  — 7  B  B_  ,  i=l,...,m-l.  This  follows  from  the  fact  that  S(s)  must  satisfy 

,1  r  t 

ds  s=z 

the  constraints  (4.7).  Thus,  the  function  S(s)  satisfies  these  constraints 
also,  and  by  the  uniqueness  result  of  [16],  S*(s)  -  S(s).  It  then  follows 
that  F  (s)  •  F(s) .  ■ 

Note  that  Conditions  (i)-(ii)  of  Theorem  4.1  are  necessary  for 
internal  stability  while  (iii)-(.v)  will  be  satisfied  by  any  practical  design. 
This  theorem  shows  that  the  class  of  designs  which  may  be  obtained  via  h“ 
methods  is  very  broad  indeed. 

Recall  that  the  application  of  H00  techniques  discussed  in  this 
chapter  involves  iterating  on  the  weighting  function  W^(s)  until  a  satisfactory 
optimal  unweighted  sensitivity  function  is  obtained.  Theorem  4.1  may  now  be 
used  to  show  to  what  extent  this  procedure  is  equivalent  to  that  discussed  in 
Section  4.3  for  design  by  iterating  on  candidate  sensitivity  functions. 

Consider  first  the  case  of  a  plant  with  a  single  (real)  right  half 
plane  zero,  z*x,  and  assume  for  simplicity  that  the  plant  is  stable.  Then, 
the  optimal  unweighted  sensitivity  function  is  related  to  the  weighting  by 
S  (s)  *  W(x)W  1(s) .  Thus  the  sequence  of  weightings  {W^}  used  in  the  H00 
procedure  may  also  be  used  to  construct  candidate  sensitivity  functions 
for  the  direct  sensitivity  design  procedure  by  setting  S.(s)  *  W.^(s).  In 


general,  the  constraint  S . (x)  *  1  will  not  be  satisfied;  however,  as  in 

A  Si(8) 

Section  4.3,  the  candidate  sensitivity  function  S^s)  »  s"~(x)~  satisfy 

S^(x)  s  1.  Suppose,  then,  that  one  may  construct  the  sequence  of  weighting 

functions  so  that  W^(s)  converges  to  a  weighting  for  which  the  function 

Iw^Cjcu)  |  satisfies  the  design  specification  on  sensitivity  sufficiently 

well.  Moreover,  suppose  that  the  corresponding  value  of  W^x)  is  approx- 

* 

imately  one.  Then,  the  optimal  unweighted  sensitivity  function  S^(s)  * 

Wi^'(s)Wi(x)  will  likely  also  satisfy  the  design  goals  sufficiently  well. 

Of  course,  the  sequence  of  candidate  sensitivity  functions  S^(s)  will 

converge  to  this  same  function.  This  follows  easily  from  the  fact  that 

Si (s)  «  w‘1(s)-Wi(x)«S*(s),  since  U 

Obviously,  the  above  procedure  may  be  reversed.  One  may  construct 

a  series  of  candidate  sensitivity  functions  S^s),  as  in  Section  4.3,  and 

then  check  to  see  if  the  constraint  S.(x)  =  1  is  satisfied.  If  not,  then 

S.(s) 

an  internally  stable  design  may  be  obtained  by  scaling:  Si(s)  =  g-  ^  . 

The  function  S^(s)  may  then  be  used  to  construct  a  weighting  function,  as 

in  Theorem  4.1.  Of  course,  this  weighting  function  will  merely  reproduce 

A  A 

the  design  S^(s);  i.e.,  the  weighting  W^(s)  =  (s)  yields  the  optimal 

unweighted  sensitivity  function  S^(s)  =  S^(s).  Again,  the  success  of  this 
procedure  depends  upon  the  ability  to  choose  a  candidate  sensitivity  function 
for  which  |s^(ju)|  satisfies  design  goals  sufficiently  well  and  for  which 
Si(x)*l. 

Thus,  for  the  case  of  a  single  right  half  plane  zero,  design 
by  iterating  on  the  weighting  function  and  by  iterating  on  the  sensitivity 
function  directly  are  equivalent .  The  only  possible  advantage  one  technique 


could  have  over  the  other  is  if  it  provided  more  insight.  However,  the 
problem  faced  by  both  procedures  is  the  same,  and  is  governed  by  the 
integral  relations  (4.8)  and  (4.19  ).  It  seems  that,  in  each  case,  successful 
design  rests  on  being  able  to  gain  insight  from  an  integral  relation. 

If  the  plant  has  more  than  one  right  half  plane  zero  (but  is  still 
assumed  stable  for  simplicity  of  exposition),  then  simply  scaling  the 
candidate  sensitivity  function  will  not,  in  general,  yield  a  new  function 
satisfying  the  constraints  at  all  the  zeros.  Choosing  a  candidate  sensitivity 
function  to  satisfy  constraints  at  more  than  one  zero  as  well  as  some  design 
goals  would  be  difficult  at  best,  despite  any  insights  available  from  the 
integral  relations.  If,  however,  such  a  function  were  found,  then  a  weighting 
constructed  as  in  Theorem  4.1  would  reproduce  this  function  as  the  solution 
to  the  H°°-optimization  problem. 

The  way  in  which  the  H®  approach  deals  with  the  case  of  more  than 
one  plant  zero  is  different  than  the  direct  sensitivity  design  approach. 

The  H°°  approach,  however,  violates  accepted  good  design  practice  by  using 
unstable  poles  in  the  compensator.  These  poles  will  be  present  unless  the 
weighting  function  is  chosen  to  satisfy  the  constraint  (4.17),  which  may 
also  be  expressed  in  terms  of  integral  relations  using  the  lemma  in  Appendix  A. 
A  little  inspection  of  (a)  the  constraints  which  a  candidate  sensitivity 
function  must  satisfy  in  order  to  yield  an  internally  stable  design  and  (b) 
the  constraints  which  a  weighting  function  must  satisfy  in  order  to  yield 
a  stable  compensator  reveals  that  essentially  the  same  problem  is  faced  by 
both  techniques.  Thus,  again,  it  seems  the  two  approaches  are  equivalent 
and  the  success  of  each  hinges  on  the  insights,  or  lack  thereof,  to  be  found 
in  a  set  of  integral  relations. 


98 


The  purpose  of  this  section  has  been  to  point  out  that  the 
H°°-optimization  approach  to  sensitivity  design  is  remarkably  similar  to  the 
method  of  "design  by  direct  choice  of  sensitivity  function."  First, 
essentially  any  design  obtained  by  one  method  may  be  obtained  by  the  other. 
Second,  the  difficulty  faced  by  both  procedures  is  the  same  -  the  fact  that 
one  must  consider  certain  integral  relations  relating  W(joj)  and  W(x)  or 
S(joj)  and  S(x).  The  success  of  either  procedure  seems  to  hinge  on  whatever 
insight  into  these  relations  may  be  gained. 


4.7.  Conclusions 

The  purpose  of  this  chapter  has  been  to  examine  the  relative  merits 
of  two  general  approaches  to  feedback  design.  These  two  approaches  are  design 
employing  open-loop  transfer  functions  and  design  employing  closed-loop 
transfer  functions.  The  design  problem  chosen  as  a  basis  for  comparison  was 
that  of  obtaining  a  feedback  system  whose  sensitivity  function  satisfies 
certain  design  specifications. 

Two  different  closed  loop  based  design  methods  were  compared  in 
this  chapter,  and  found  to  be  substantially  equivalent  when  used  to  address 
the  problem  posed.  In  particular,  both  were  complicated  by  the  fact  that 
the  values  of  a  stable  minimum  phase  transfer  function  at  points  in  the  open 
right  half  plane  are  completely  determined  by  the  values  of  the  function 
along  the  jw-axis.  Even  with  whatever  insights  may  be  gained  from  the 
integral  relations  developed  in  Chapter  3,  this  difficulty  appears  nontrivial. 


The  preceding  observations  suggest  that  open  loop  based  design 
techniques  may  have  some  real  advantages,  since  the  compensator  may  be 
chosen  a  priori  to  be  within  a  given  admissible  class.  This  doesn't 
necessarily  imply  that  open  loop  techniques  are  generally  better  than  closed 
loop  techniques;  they  are  different,  and  must  deal  with  problems  in 
different  ways.  For  example,  using  open  loop  techniques  only  allows  one  to 
evaluate  closed  loop  behavior  indirectly.  Classical  single-loop  design 
methods  utilized  various  rules  of  thumb  to  expedite  this  process.  An 
attempt  to  generalize  some  of  these  rules  of  thumb  to  multiple-loop  systems 
is  discussed  in  Chapter  8.  Again,  as  was  stated  in  the  introduction  to 
this  chapter,  the  purpose  of  the  preceding  discussion  is  not  to  completely 
discourage  development  of  closed  loop  based  design  techniques .  On  the  other 
hand,  the  discussion  has  pointed  out  some  problems  in  which  readily  usable 
information  seems  to  be  lost  in  passing  from  open  to  closed  loop  methods. 
Clarification  of  these  problems  and  related  issues  should  help  in  the 
development  of  any  design  scheme. 

One  final  remark  will  now  be  made.  It  is  possible  to  interpret  an 
iterative  design  process  as  an  exploration  of  the  set  of  possible  designs. 

One  purpose  of  such  an  exploration  is  to  gain  Insight  into  the  properties  that 
this  set  possesses.  (In  particular,  a  distinction  should  be  made  between 
characterizations  of  possible  designs  which  provide  insight  into  the  achieve¬ 
ment  of  design  goals  and  those  characterizations  which  do  not.)  If  such  an 
exploration  is  done  in  conjunction  with  the  development  of  design  specifica¬ 
tions,  then  the  task  of  arriving  at  specif icar-1  cns  which  realistically  reflect 
the  essential  tradeoffs  inherent  in  the  problem  is  facilitated.  One 


conclusion  may  be  that  the  so-called  "trial  and  error"  aspect  of  classical 
design  is  present  for  a  very  good  reason,  and  not  necessarily  as  an  indication 
of  the  lack  of  more  precise  methods.  Another  conclusion  is  that  it  may  be 
unrealistic  to  impose  design  specifications  based  upon  properties  desired 
from  a  design  until  some  idea  of  the  properties  available  is  obtained. 

Finally,  the  viewpoint  of  the  preceding  paragraph  suggests  a 
reasonable  criterion  of  merit  by  which  proposed  feedback  design  methods  may 
be  judged.  This  criterion  is  the  ability  of  the  method  to  efficiently  yield 
insights  into  the  class  of  feedback  designs  possible  for  a  given  plant. 

Of  course,  other  properties  must  also  be  possessed  by  a  technique  in  order 
for  it  to  be  generally  useful. 
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CHAPTER  5 

PREVIOUS  MIMO  RESULTS  AND  MOTIVATION  FOR  PRESENT  WORK 

5.1.  Introduction 

In  Chapter  2  certain  Important  concepts  from  classical  control 
theory  were  discussed.  The  purpose  of  this  chapter  Is  to  discuss  previous 
work  on  the  extension  of  these  concepts  to  MIMO  systems  and  to  point  out 
where  further  work  needs  to  be  done.  Thus  this  chapter  motivates  the  study 
of  some  specific  problems  to  be  proposed  in  the  remainder  of  the  thesis. 

The  discussion  will  at  first  parallel  closely  that  of  Chapter  2.  Portions 
of  this  chapter  have  appeared  in  [29]. 

5.2.  Basic  Equations  for  MIMO  Feedback  Systems 

Consider  a  linear  time-invariant  multiple-input  multiple-output 
(MIMO)  feedback  system  as  shown  in  Figure  5.1.  This  feedback  structure  is 
the  same  as  that  of  Figure  2.1,  for  SISO  systems,  with  the  exception  that 
all  transfer  functions  are  matrices  and  all  signals  are  vectors.  For 
simplicity,  only  square  transfer  function  matrices  are  considered;  such  cases 
are  sufficient  to  illustrate  the  ideas  discussed  in  this  thesis.  Specifically, 
P(s)  and  F(s)  are  the  transfer  functions  of  the  plant  model  and  feedback 
compensator,  respectively,  and  take  values  in  I11  n.  The  signal  r(s)  is  the 
reference  input;  y(s),  the  system  output;  d(s),  a  disturbance  input;  and  n(s) , 
sensor  noise.  All  signals  take  values  in  Cn. 

The  output  of  the  feedback  system  is  again  given  by 

y(s)  -  yr(s)  +  yd(s)  +  yn(s)  (5.1) 

yr (s)  *  [I  +  P(s)F(s)]-1  P(s)r(s) 


(5.2) 
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yd(s)  -  [I  +  P(s)F(s)r  d(s) 


(5.3) 


yn(s)  ■  -  [I  +  P(s)F(s)]"-l  P(s)F(s)n(s) 


(5.4) 


Define  the  open  loop  transfer  function 


L(s)  =  PCs)F(s) 


(5.5) 


the  system  sensitivity  function 


S(s)  -  (I  +  L(s) ]_1 


(5.6) 


and  the  complementary  sensitivity  function  [2] 


T(s)  -  [I  +  L(s) ]_1  L(s) 


(5.7) 


Note  the  transfer  functions,  (5-5)  —  (5.7),  are  obtained  by  breaking  the  loop 
at  the  output  of  the  plant  (denoted  by  (xx)  in  Figure  5.1).  Breaking  the 
loop  at  the  plant  input  yields  a  generally  different  set  of  transfer  functions 
due  to  noncommutativity  of  matrix  multiplication.  These  transfer  functions 
are  obtained  by  reversing  the  order  of  P(s)  and  F(s)  in  (5.5)— (5.7) . 
Implications  of  this  lack  of  commutativity  will  be  discussed  shortly. 

For  later  reference,  define 


L1(s)  «  FCs)P(s) 

S1(s)  ■  [I  +  LjU)]"1 
T^s)  -  [I  +  L1(s)]"1  Lx(s) 


(5.8) 


(5.9) 


(5.10) 


Assume  that  L(s)  and  L^(s)  are  free  of  unstable  hidden  modes.  Then  the 
feedback  system  is  stable  if  S(s)  (equivalently  S^(s))  is  bounded  in  the 
closed  right  half  plane.  Note  the  assumption  on  L(s)  and  L^(s)  implies,  as 
in  the  SISO  case,  that  right  half  plane  poles  and  zeros  of  the  plant  and 


compensator  must  appear  with  at  least  the  same  multiplicity  in  L(s)  and 
L^(s).  In  addition,  this  assumption  imposes  conditions  on  the  left  and 
right  null  spaces  of  L(s)  and  L^(s)  evaluated  at  right  half  plane  plant 
and  compensator  zeros.  A  similar  condition  holds  for  open  right  half  plane 
poles . 

One  difference  from  SISO  systems  is  that  vector  and  matrix  norms 
must  be  used  to  measure  the  magnitudes  of  signals  and  transfer  functions. 

In  this  thesis  attention  will  be  restricted  to  the  standard  Euclidean  vector 
norm  and  the  induced  matrix  two-norm.  This  matrix  norm  is  also  referred  to 
as  the  singular  value  norm;  a  discussion  of  relevant  properties  of  singular 
values  is  contained  in  Appendix  B. 

From  (5.3)  and  (5.6)  it  follows  that  the  response  of  the  system  to 
disturbance  inputs  is  determined  by  the  sensitivity  function  with  the  loop 
broken  at  the  plant  output.  Since 

a[S(jui)]  -  max  jS(ju>)u|_  (5.11) 

u€l"  * 


it  follows  that  the  response  of  the  system  to  disturbances  at  frequency  ui  can 
be  made  small  by  requiring  that  o[S  (ja>)  ]<<1.  Note  (5.11)  implies  that  the 
response  to  a  disturbance  will  be  small  independently  of  the  subspace  of  ffn 
in  which  it  lies.  Similarly,  the  response  of  the  system  sensor  noise  is 
determined  by  T(s),  and  can  be  made  small  by  requiring  a [T(juO  ]<<1. 

The  benefits  of  feedback  in  reducing  the  effects  of  uncertainty 
in  the  plant  model  upon  the  system  output  can  be  assessed  from  [3] 


K 


Ec(s)  -  S'(s)Eo(s)  (5.12) 

S ' (s)  -  [I  +  PXs)F(s) ]_1  (5.13) 

where  P'(s)  -  [I  +  A(s)]P(s)  is  the  true  plant.  The  signals  Ec(s)  and  Eq(s) 
are  the  deviations  in  the  outputs  of  nominally  equivalent  open  and  closed 
loop  systems  caused  by  the  model  error  A(s).  If  at  some  frequency 

o [S'(jca)  ]  <  1  ,  (5.14) 

then  the  closed  loop  system  is  said  to  possess  the  sensitivity  reduction 
property  at  that  frequency.  Condition  (5.14)  is  difficult  to  design  for 
since  it  depends  on  the  true  unknown  plant  rather  than  the  model.  Nonetheless 
at  frequencies  for  which  a[A(juO]  <1  it  is  possible  [4]  to  insure  that  (5.14) 
holds  by  requiring  the  nominal  sensitivity  function  to  have  small  magnitude. 
(Although  this  approach  may  fail  dramatically  [10]  when  the  uncertainty 
cannot  be  assumed  arbitrarily  small,  or  when  uncertainty  is  present  which 
cannot  be  modelled  as  above . ) 

The  sensitivity  and  complementary  sensitivity  functions  are  also 
related  to  stability  robustness  properties  of  the  system  (see,  e.g.  [5], 

Table  1) .  Suppose  that  uncertainty  is  modelled  as  occurring  at  the  output 
of  the  plant: 

P’(s)  *  [I  +  Ao(s)]P(s)  .  '  (5.15) 

Assume  that 

(i)  P'(s)  and  P(s)  have  the  same  number  of  unstable  poles  and 

(ii)  A  (s)  is  otherwise  arbitrary  subject  to  a  frequency 


dependent  norm  bound 


CIA  (ju))j  <  Ma(o»)  Vw 


(5.16) 


Then  if  the  nominal  system  (Aq(s)  =  0)  is  stable,  the  true  system  is 
guaranteed  to  be  stable  if  [4] 


Ma(u>)  < 


a[T(ju>)] 


(5.17) 


This  bound  requires  that  a[T(joj)3  be  small  at  frequencies  for  which  uncertainty 
modelled  in  (5.15),  is  large. 

An  important  difference  between  SISO  and  MIMO  systems  is  the  fact 
that  stability  margins  in  the  latter  are  generally  different  at  different 
points  in  the  feedback  loop.  Mathematically  this  occurs  because  matrix 
multiplication  is  not  in  general  commutative,  as  was  pointed  out  in  the 
above  discussion  of  transfer  functions  obtained  by  breaking  the  loop  at 
different  points  (Eqs.  C5 .5)— (5.7)  compared  to  (Eqs.  (5 . 8)-(5 . 10) ) . 

To  illustrate,  suppose  that  uncertainty  is  modelled  as  occurring  at  the 
plant  input 


P’(s)  -  PCs) [I  +  4  (s)] 


(5.18) 


rather  than  at  the  plant  output  as  in  (5.15).  Under  assumptions  analogous 
to  (i)  and  (ii) ,  if  the  nominal  system  is  stable,  the  true  system  is 
guaranteed  to  be  stable  if  [4] 


Ma(co)  < 


o[T,(ju))] 


(5.19) 


where  T^(s)  is  given  by  (5.10).  Thus,  MIMO  systems  can  have  small  stability 
margins  in  one  loop  location  despite  having  good  margins  elsewhere.  Moreover, 
when  uncertainty  is  present  at  both  the  input  and  the  output  to  the  plant,  the 
system  can  be  close  to  instability  despite  good  margins  against  uncertainty 
occurring  at  each  point  separately.  Such  phenomena  are  examples  of  MIMO 
system  behavior  which  has  no  analogue  in  SISO  systems.  Examples  are  found 
in  [10].  A  framework  for  analyzing  robustness  against  uncertainty  occurring 
in  different  loop  locations  has  been  developed  by  Doyle  [30]  and  Doyle,  Wall 
and  Stein  [5]. 


Special  Cases  in  which  SISO  Concepts  Generalize  to  MIMO  Systems 


The  above  discussion  has  shown  that  the  first  concept  from  SISO 
theory  discussed  in  Chapter  2  generalizes  readily  to  MIMO  systems.  Closed 
loop  properties  can  be  determined  by  examining  certain  closed  loop  transfer 
function  matrices.  This  point  of  view  is  stressed  in  [4,11-14]. 

Differences  are  that  more  matrices  must  be  considered,  including 
some  nonstandard  ones  [5,30]  when  uncertainty  is  present  at  more  than  one 
location  in  the  loop.  Attention  in  this  thesis  will  be  restricted  to 
uncertainty  modelled  as  occurring  at  only  one  location  in  the  loop  so  that 
matrices  such  as  (5.6)-(5.7)  or  (5. 9) -(5. 10)  suffice  to  determine  feedback 
properties . 

The  second  theme  from  SISO  feedback  theory  is  that  the  relation 
between  open  and  closed  loop  system  properties  can  be  approximated  fairly 
well.  Under  certain  conditions  this  relation  extends  to  MIMO  systems. 

First  some  terminology  will  be  introduced. 


[IK] 


From  the  definition  of  singular  values  of  a  matrix  M=  C 


max  i 
u€Cn  ' 

I u!  2  “  1 


u6  Cn 

Iu!  2  “  1 


Equations  (5.20)  and  (5.21)  motivate  the  following  terminology.  When  M  is 
a  transfer  function  matrix,  M  =  M(s) ,  evaluated  at  some  frequency  w,  then 
the  gain  of  the  underlying  system  is  said  to  be  large  in  all  directions  at 
that  frequency,  provided  that  o[M(ju))  ]>>1.  Similarly,  the  gain  is  said  to  be 
small  in  all  directions,  provided  that  a(M(jw) ]<<1.  If  at  some  frequency 
o  [M(joj)  ]  <<l<<a  [M(ju)  ],  then  the  system  has  both  large  and  small  gain  (in 
different  directions)  at  that  frequency.  Use  of  singular  values  to 
generalize  the  notion  of  gain  has  been  discussed  by  several  authors;  an 
early  reference  is  MacFarlane  and  Scott-Jones  [31]. 

From  (5.6)  and  (5.7)  the  following  relations  between  levels  of 
open  loop  gain  and  closed  loop  properties  can  be  deduced: 


o[L(jw)  ]>>1 


a [S ( jw) ] <<1 


T(jw)  58  I 


a[T(ju>)  ]<<1 


a[L(ju)  ]<<1 


S (joj)  88  I 


Thus,  at  frequencies  for  which  open  loop  gain  is  either  large  in  all  directions 
or  small  in  all  directions,  the  SISO  rules  of  thumb  (2.19)  and  (2.20)  for 
relating  open  to  closed  loop  properties  generalize  directly.  When  neither  of 
these  assumptions  on  the  gain  of  L(joi)  hold,  then  previous  results  only  suffice 
to  relate  open  to  closed  loop  properties  in  special  cases,  e.g.,  diagonal 
systems . 

Another  important  concept  from  classical  feedback  theory  was  the 
existence  of  tradeoffs  among  various  design  goals.  These  tradeoffs  are  also 
present  in  MIMO  systems  although  their  exact  nature  is  much  more  complicated. 
Known  generalizations  of  analytic  tradeoffs  among  system  properties  in 
different  frequency  ranges  will  be  described  first. 

Tradeoffs  imposed  by  open  right  half  plane  poles  and  zeros  upon  the 
sensitivity  and  complementary  sensitivity  functions  were  described  briefly 
in  Chapter  3.  An  advantage  of  these  results  was  that  they  showed  that  open 
right  half  plane  poles  and  zeros  affect  feedback  properties  only  in  certain 
directions.  A  generalization  (due  to  Doug  Looze)  of  Bode's  integral  theorem 
(Theorems  2.1  and  3.3)  is  now  given. 

Theorem  5.1:  Assume  that  the  open  loop  transfer  function  matrix  L(s)  possesses 
finitely  many  open  right  half  plane  poles  { p^  ;  i*l , . . . ,Np }  including 
multiplicities.  In  addition,  assume  that  each  element  (s) ;i, j“l, . . .  ,n 
satisfies 

lim  sup  R|L  .  (s)  |  *  0  .  (5.24) 

R-*>»  j  s  |  >R 
Re[s]>0 

Then,  if  the  closed  loop  system  is  stable,  the  determinant  of  the  sensitivity 
function  must  satisfy 
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tt  E  Re[p.  ]  -  /  log  j  det  [S  (joj)  ]  |  da) 
1-1  0 


(5.25) 


Proof :  See  Appendix  C.  ■ 

Corollary  5.2;  Assume  the  hypothesis  of  Theorem  5.1  is  satisfied.  Then, 
the  singular  values  of  the  sensitivity  function  must  satisfy 


p  n  00 

it  E  Re[p.  ]  -  E  /  logo ,  [S(jo>)  ]dw  -  0  .  (5.26) 

i-1  i-1  0 


Proof :  See  Appendix  C.  ■ 

Corollary  5.2  shows  that  a  tradeoff  between  sensitivity  reduction 
and  sensitivity  increase  exists  for  MIMO  systems  as  well  as  SISO  systems. 

The  difference  is  that  the  tradeoff  is  averaged  over  all  singular  values  of 
S(joi) .  Presumably  sensitivity  properties  could  be  traded  off  among  different 
directions  as  well  as  among  different  frequency  ranges,  although  no  insight 
is  given  into  how  this  might  be  achieved.  Note,  however,  that  if 

o^ [S (joj)  ]  =  aj[S(j<j)]  Vi#j  and  Vu>  ,  (5.27) 

then  the  tradeoff  for  open  loop  stable  systems  is  exactly  the  same  as  in 
the  SISO  case.  For  open  loop  unstable  systems  the  additional  penalty  due 
to  the  right  half  plane  poles  of  L(s)  can  apparently  be  averaged  over  all 
loops  of  the  system  or  can  be  concentrated  more  or  less  in  some  loops. 

Again,  no  insight  into  how  to  achieve  this  is  available. 


vv 
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In  summary.  Corollary  5.2  indicates  that  tradeoffs  analogous  to 
those  found  in  SISO  systems  are  present,  but  that  they  can  be  made  between 
different  directions  as  well  as  between  frequency  ranges.  When  the  behavior 
of  the  feedback  system  is  assumed  to  be  the  same  in  all  directions,  (i.e., 
(5.27)  holds)  the  tradeoffs  reduce  to  those  found  in  SISO  systems,  with  the 
exception  noted  when  the  system  is  open  loop  unstable.  As  pointed  out, 
little  is  known  about  the  general  case. 

The  Bode  gain-phase  relations  (Theorem  2.2)  have  also  been  studied 
in  the  context  of  MIMO  systems  (Doyle  and  Stein  [4]),  although  again  the 
results  are  primarily  useful  for  systems  which  have  nearly  identical  properties 
in  all  directions.  Doyle's  results  utilize  the  fact  that  eigenvalues  and 
singular  values  satisfy  the  inequality 


o[M]  <  | A  [M] |  <  a[M]  ,  Vi 


(5.28) 


Thus,  if  one  can  show  that  |  [S(joj)  ]  j  >>1  for  some  i,  then  it  follows  that 


a[S(j(u)  ]>>1.  Since 


XiCSCja,)]  *  l+A1fL(j«u)] 


(5.29) 


one  can  study  the  relation  between  gain  and  phase  of  the  eigenvalues  of 
L(jw)  near  crossover  to  see  if  tradeoffs  such  as  those  imposed  by  the  SISO 
gain-phase  relations  are  present. 


A  complete  analysis  is  given  in  [4,  pp.  9-10];  the  conclusion  is 


that  the  relation  between 


d  log|Ai(jw) 


and  (jto)  is  the  same  as  for  a 


SISO  transfer  function  if  A^(s)  has  no  right  half  plane  branch  points. 


If  X^(s)  has  branch  points,  then  the  relation  holds  averaged  over  all  the 
eigenvalues  defined  on  the  same  Riemann  surface.  These  relations  are  most 
informative  about  feedback  properties  in  the  special  case  that 
o[L(ju>)]  =  |^[L(ju>)]|  =  a[L(jai)]  in  the  vicinity  of  crossover.  If  this 
condition  holds,  then  the  tradeoff  is  the  same  as  that  for  SISO  systems. 

When  there  exist  different  levels  of  gain  in  different  directions,  then 
generally 

o[L(joj)]  >  |  A±  [L(ju»)  ]  |  >  a  [L(jua)  ]  Vi  (5 . 3 

and  the  eigenvalue  magnitudes  need  not  even  closely  approximate  the  singular 
values.  Thus,  the  gain  in  some  loops  could  decrease  rapidly  near  crossover 
with  no  corresponding  rapid  decrease  in  the  |  X^[L(jai)  ]  |  .  No  insight  into 
this  situation  is  provided,  however. 

The  other  class  of  tradeoffs  found  in  SISO  systems  was  algebraic . 
Those  tradeoffs  exist  between  system  properties  at  any  given  frequency  and 
follow  from  an  identity  which  generalizes  to  MIMO  systems  as 

Sts)  +  T(s)  =  I  .  (5.3: 

The  algebraic  tradeoffs  discussed  in  Chapter  2  extend  to  MIMO  systems 
in  the  following  manner.  If  good  sensitivity  reduction  and  disturbance 
rejection  properties  are  present  in  all  directions  at  a  given  frequency 
(a[S(j<i>)  ]«1)  ,  then  from  (5.31),  good  sensor  noise  rejection  and  large 
stability  margins  cannot  be  present  in  any  direction  (T(s)ssI).  A  similar 
statement  holds  if  a[T(jw)]<<l. 


In  summary,  the  relation  between  open  and  closed  loop  system 
properties,  and  the  analysis  of  tradeoffs  among  system  properties  extends 
readily  to  MIMO  systems  only  when  these  properties  are  constrained  to  be 
approximately  the  same  in  all  directions.  These  cases  may  be  viewed  merely 
as  special  instances  for  which  no  uniquely  MIMO  system  properties  are 
present.  The  usefulness  of  these  results  will  be  limited  to  cases  for 
which 

(i)  the  plant  has  approximately  equal  bandwidth  in  all 
directions  or  loops, 

(ii)  only  unstructured  uncertainty,  sensor  noise,  and 
disturbance  inputs  are  present;  that  is,  the  magnitudes  of  these  quantities 
are  approximately  the  same  in  all  directions. 

Condition  (i)  will  hold  whenever  a[P(jcu)  ]  3  a[P(jw)]  Vtu.  Now, 
recall  that  the  classical  design  methods  focussed  on  modifying  open  loop 
properties  via  a  suitable  choice  of  compensation  to  achieve  the  desired 
closed  loop  properties.  Obviously,  this  process  is  expedited  if  insight 
into  the  relation  between  open  and  closed  loop  properties  is  available. 

If  Condition  (i)  fails,  in  particular,  if  the  open  loop  plant  has  both 
large  and  small  gain  in  some  frequency  range,  then  design  will  be  complicated 
by  lack,  of  necessary  insight. 

Condition  (ii)  holds  for  uncertainty,  as  modelled  in  (5.16).  This 
condition  will  also  hold  for  disturbance  inputs  and  sensor  noise  which  are 
arbitrary,  subject  only  to  frequency  dependent  norm  bounds: 


|d(Jw)|2  <  Md(u»)  Vw  (5.32) 

and 

|n(jm)|2  <  Mn(u>)  Vu>  .  (5.33) 

If  Condition  (ii)  fails  to  hold,  then  one  is  led  to  consider 
design  goals  on  S(s)  and  T(s)  which  specify  different  properties  in 
different  loops.  To  see  this,  some  simple  models  for  structured  uncertainty, 
sensor  noise,  and  disturbance  inputs  are  considered  in  the  next  section. 


5.4.  Structure  in  MIMO  Systems  and  Failure  of  SISO  Generalizations 

In  MIMO  systems  it  is  reasonable  to  expect  that  levels  of  disturbance 
inputs  and/or  sensor  noise  may  be  significantly  different  in  dif fe-^r t 
directions  over  the  same  frequency  range.  To  illustrate,  suppose  that  sensors 
in  some  loops  are  noisy  at  a  lower  frequency  than  sensors  in  other  loops, 
while  the  level  of  disturbance  inputs  is  approximately  the  same  in  all  loops 
at  any  given  frequency.  This  situation  can  be  modelled  mathematically  by 
assuming  the  sensor  noise  can  be  separated  into  two  components : 


n(s)  -  ns  (s)  +  nu(s) 


(5.34) 


The  component  ng(s)  lies  in  a  k-dimensional  subspace  Ng  of  and  represents 
structured  sensor  noise.  The  component  nu(s)  represents  an  additional 
unstructured  component,  and  is  assumed  to  lie  in  Cn.  Suppose  that  the 
levels  of  the  two  components  of  sensor  noise  and  the  level  of  disturbance 
inputs  are  bounded  as 


,*V 


u 


Vw 


!  ns  (Ju)  I  2  -  Ms(u,) 


(5.35) 


|d(ju>)|2  <  Md(w) 


(5.37) 
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i)  Vu 
Vu> 

Next,  suppose  that  the  ratios  of  the  bounds  on  the  two  components  of  sensor 

noise  compared  to  the  bound  on  the  disturbance  Inputs  is  as  shown  in 

Figure  5.2.  This  figure  indicates  that  the  structured  component  of  the 

sensor  noise  becomes  relatively  larger  at  a  lower  frequency  than  does  the 

unstructured  component.  Thus,  the  following  design  specification  on  S(s) 

and  T(s)  is  motivated.  First,  let  Pg  be  an  orthogonal  projection  [32] 

onto  the  subspace  N  of  Cn.  Then,  *  I  -  P  is  an  orthogonal  projection 

s  s  s 

onto  S‘L,  the  orthogonal  complement  of  X  .  The  design  specification  is 
s  s 

thus : 


o [ S ( j to)  ]«1 

0)  <  U)^ 

(5.38a) 

o[T(jw)P  ]«1 

s 

u>2  <  a>  < 

(5.38b) 

a[S(jw)PA]«l 

s 

0)^  <  U)  <  (1)3 

(5.38c) 

a[T(jaj)]«l 

e 

■t- 

1  A 

e 

(5 . 38d) 

This  specification  provides  for  disturbance  rejection  in  all 
directions  in  the  low  frequency  range  [0,u^]  for  which  disturbances  dominate 
the  sensor  noise  in  all  directions.  At  frequencies  greater  than  sensor 
noise  in  the  subspace  Ng  should  be  rejected,  since,  the  level  of  sensor 
noise  dominates  the  level  of  disturbance  inputs  at  these  frequencies.  On  the 
other  hand,  disturbances  in  the  subspace  should  be  rejected  at  frequencies 
less  than  This  follows  since  the  level  of  disturbances  in  this  subspace 


still  dominates  sensor  noise  levels  at  these  frequencies.  Finally,  at 
frequencies  greater  than  tu,  ,  sensor  noise  should  be  rejected  in  all  directions. 


Figure  5.2.  Relative  levels  of  sensor  noise  to  disturbance  inputs. 


Thus  in  this  example  the  physical  nature  of  the  extraneous  inputs  dictated 
the  design  of  a  system  with  significantly  different  properties  in  different 
directions  over  the  frequency  range 

Design  specifications  such  as  that  just  discussed  can  also  arise 
when  structured  uncertainty  is  present .  To  see  this ,  consider  uncertainty 
modelled  as  occurring  at  the  plant  output  as  in  (5.15)  with  4fl(s)StnXn 
given  by 

A  (s)  -  a(s)  YX11  +  A  (s)  .  (5.3! 

o  r 

H 

Here,  YX  is  a  constant  matrix  of  rank  k  whose  k  nonzero  singular  values 
are  equal  to  one,  a(s)  is  a  scalar  transfer  function  satisfying 

| a(joi)  |  <  Ma(oi)  Vai  (5.4< 

and  A^ (s)  represents  residual  unstructured  uncertainty  with 

a[Ar(joj)]  <  Mr(w)  Vco  .  (5.4! 

The  first  term  in  (5.39)  represents  structured  uncertainty  which  becomes 

large  at  lower  frequencies  than  the  remaining  unstructured  component.  For 

example,  since  the  uncertainty  is  modelled  as  occurring  at  the  plant  output, 

this  might  represent  unmodelled  sensor  dynamics  in  the  ith  loop  (in  which 
H 

case  YX  =  diag[0, .. .0,1,0, .. .0]  with  the  one  in  the  ith  position). 

A  similar  model  using  uncertainty  occurring  at  the  plant  input,  as  in  (5.18), 
could  be  used  to  represent  unmodelled  actuator  dynamics. 

The  proof  of  the  stability  robustness  theorem  in  [4,  p.  7]  can  be 
followed  to  show  that  stability  is  guaranteed, provided  that 


(5.4 


o[Ao(jw)T(Jw)]<l  Vo) 

At  frequencies  for  which  Ag(juj)  ct(ju))YX**,  it  follows  that 

M  (u>)  <  - ^ -  (5.4 

^[YX*1  T(Jw)] 

must  hold.  Suppose  that  M  (w) >>1, and  let  P„  be  an  orthogonal  projection 

Cl  A 

onto  J?x,  the  column  space  of  X^.  It  follows  that  (5.43)  implies 

T(j«)]«l  •  (5.4' 

Thus,  T(jcu)  must  be  small,  but  only  in  a  k-dimensional  subspace  of  Cn. 
Specifically,  the  range  of  T(jui)  must  be  restricted  to  the  subspace  the 
null  space  of  the  structured  uncertainty. 

From  (5.31)  it  follows  that  (5.44)  implies 

PxS(jui)«Px  .  (5.4! 

Thus,  S(juj)  is  approximately  the  identity  on  a  k-dimensional  subspace  of 
*n.  Although  the  subspace  containing  disturbances  which  cannot  be  rejected 
is  not  specified, the  response  to  these  disturbances  is  constrained  to  lie  in 
the  subspace  f^.  As  in  the  discussion  of  structured  sensor  noise,  S(jw)  can 
still  reject  disturbances  lying  in  an  (n  -  k) -dimensional  subspace. 


The  above  discussion  has  pointed  out  that  there  exist  many  cases 
of  interest  for  which  the  closed  loop  system  has  significantly  different 
properties  in  different  directions  over  some  frequency  range.  It  was  also 
pointed  out  that  several  important  classical  concepts  have  not  been  extended 
to  MIMO  systems  which  possess  such  properties.  One  such  concept  is  an 
understanding  of  the  relation  between  open  and  closed  loop  system  properties. 
Such  understanding  is  necessary  to  shape  the  open  loop  transfer  function  via 
the  choice  of  feedback  compensator.  Another  such  concept  is  the  statement  of 
various  tradeoffs  among  system  properties  in  different  frequency  ranges. 
Precise  statements  of  these  tradeoffs  are  necessary  to  assess  whether  a  given 
design  specification  is  achievable  by  a  linear  time-invariant  design.  Both 
these  questions  will  be  addressed,  although  not  solved  completely,  in  the 
remainder  of  this  thesis.  First,  the  role  of  phase  in  MIMO  systems  will  be 


discussed. 
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CHAPTER  6 

GAIN,  PHASE,  AND  DIRECTIONS  IN  MIMO  SYSTEMS 
(PHYSICAL  INTERPRETATIONS) 


6.1.  Introduction 

The  purpose  of  this  chapter  Is  to  discuss  generalizations  of  the 
classical  SISO  system  concepts  of  gain  and  phase  to  MIMO  systems.  In  addition, 
the  concept  of  direction  of  a  vector  signal  will  be  discussed.  Directionality 
has,  of  course,  no  analogue  in  SISO  system  theory. 

First,  the  physical  meaning  of  a  transfer  function  is  discussed  as 
it  relates  to  the  steady-state  response  of  a  linear  time-invariant  system  to 
sinusoidal  inputs.  Next,  the  meaning  of  the  direction  of  a  vector  signal  is 
discussed,  along  with  a  measure  of  distance  between  signals  lying  in  different 
directions.  The  concept  of  the  gain  of  an  SISO  transfer  function  is  discussed, 
along  with  the  generalization  to  MIMO  systems.  Although  the  use  of  the 
singular  value  decomposition  (SVD)  to  exhibit  a  canonical  set  of  gains  is 
well-known  (see,  e.g.,  [4], [31])  this  idea  is  reviewed  for  completeness. 

Generalization  of  the  concept  of  phase  to  MIMO  systems  has  not  been 
as  straightforward.  On  the  one  hand,  the  importance  of  the  phase  concept 
has  been  called  into  question.  In  particular,  it  has  been  shown  (e.g.,  [1] 
and  the  discussion  in  Section  2.1)  that  the  concept  of  phase  margin  is  not 
of  primary  importance  even  in  the  single  loop  case.  On  the  other  hand, 
several  different  definitions  of  phase  have  been  proposed  by  MacFarlane  and 
co-workers  ([14], [33]).  Given  these  observations,  perhaps  the  most  accurate 
assessment  is  that  a  generally  accepted  concept  of  phase  in  multiple  loop 
systems  is  not  yet  available.  Rather  than  attempt  such  a  definition,  in  this 


chapter,  some  physical  interpretations  of  phase  in  single  loop  system  are  first 


explored.  Multiple  loop  systems  are  then  examined  to  see  if  similar  inter¬ 
pretations  are  available  in  the  general  case.  The  approach  which  will  be 
taken  is  based  upon  the  way  in  which  vector-valued  sinusoidal  signals  interact 
when  added  together.  Portions  of  this  chapter  will  appear  in  [34]. 


6.2.  Properties  of  Transfer  Functions 

The  response  of  a  linear  time  invariant  system  to  an  input  is  a 
function  of  both  the  input  and  the  initial  conditions.  If  these  initial 
conditions  are  equal  to  zero,  then  the  response  may  be  calculated  from  the 
transfer  function  of  the  system.  Let  this  transfer  function  be  denoted  by 
L(s) €  Cnxn  and  consider  inputs  of  the  form 

u(t)  -  keSot  (6.1) 

where  k€<cn  is  a  constant  vector  and  sq  =»  xq  +  jyQ  is  the  complex  frequency 
of  the  input.  Entries  of  k  can,  of  course,  be  complex;  note,  however,  that 
(6.1)  can  be  rewritten 

(6.2) 


where  each  r^  >  0  and  different  values  of  imply  phase  differences  between 


u(t) 


r  es°t+*l 
1 

r  esot+*2 
r2e 


r  eSot+0n 
n 


the  components  of  the  sinusoidal  signal. 


The  Laplace  transform  of  u(t)  is  given  by 


u(s)  *  k  ~~  .  (6.3) 

S*“S 

o 

The  frequency  response  of  the  system,  given  the  input  u(s) ,  is 

y (s)  *  L(s)u(s)  .  (6.4) 

The  corresponding  time  response  will,  in  general,  have  contributions  from 
the  poles  of  L(s)  as  well  as  the  pole  s *  sq  of  the  input.  For  stable  systems 
the  former  contributions  go  to  zero  in  the  steady  state  and  the  time  response 
is  given  by 

y(t)  *  L(sQ)u(t)  .  (6.5) 

In  fact  it  can  be  shown  [35]  that  the  input  u(t)  can  be  augmented  with  a 
set  of  delta  functions  and  derivatives  of  delta  functions  so  that  the  steady 
state  is  reached  at  t  *  0+.  Thus,  (6.5)  illustrates  the  manner  in  which 
properties  of  transfer  functions  are  related  to  the  time  responses  of  the 
system  to  complex  exponential  inputs. 


6.3.  Directions  of  Vector  Signals 

From  (6.4)  and  (6.5)  one  can  see  that  the  action  of  a  transfer 
function  matrix  upon  an  input  (6.1)  is  solely  a  function  of  the  (one¬ 
dimensional)  subspace  of  Cn  spanned  by  k.  That  is,  choosing  an  input 
u(t)  =  c  u(.t) ,  with  c  a  complex  scalar,  yields  an  output  y(t)  =  c  y(t)  . 

Since  linear  systems  are  being  studied,  this  is  an  obvious  remark;  however, 
this  is  the  first  manifestation  of  an  idea  which  will  recur  throughout  the 
remainder  of  this  thesis.  Namely,  given  a  one-dimensional  subspace  of  C° 
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there  exist  two  degrees  of  freedom  in  specifying  a  vector  within  that 
subspace.  One  degree  of  freedom  corresponds  to  the  magnitude  of  the  vector; 
the  other  is  analogous  to  phase.  Many  system  properties  of  interest  depend 
only  on  the  subspace  in  which  a  signal  lies.  For  now,  discussion  will  be 
limited  to  making  precise  the  notion  of  the  direction  of  a  vector  signal. 

The  set  of  directions  in  ®n  will  be  defined  as  the  set  of  one¬ 
dimensional  subspaces  of  Cn.  The  following  notation  is  convenient:  Given 
a  nonzero  vector  in  In  denoted  by  a  small  letter,  e.g.,  k,  then  the 
one-dimensional  subspace  of  ®n  spanned  by  this  vector  will  be  denoted  by  a 
capital  script  letter,  e.g.,  K.  (This  convention  will  later  be  extended  to 
subspaces  of  dimension  greater  than  one.)  Thus,  the  direction  of  a  signal 
is  given  by  the  one-dimensional  subspace  of  ®n  which  it  spans. 


A  method  of  measuring  the  distance  between  directions,  or  one¬ 
dimensional  subspaces  of  Cn,  will  be  needed.  It  is  standard  [36  ]  to  define 
the  angle  <j>  between  two  one-dimensional  subspaces  and  C ^  of  Cn  from 


cos 


(6.6) 


where  c^  is  an  arbitrary  nonzero  vector  in  Note  that  0  <  <  y.  Thus 

)  is  a  measure  of  angular  distance  between  the  subspaces  and  C^.  For 
$*0,  the  two  subspaces  are  identical;  for  <p  =  tt/2,  the  two  subspaces  are 
orthogonal.  The  concept  of  angles  between  subspaces  of  Cn  can  also  be 
extended  to  subspaces  of  dimension  greater  than  one  [36]. 
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6.4.  Gain  in  SISO  and  MIMO  Systems 


For  SISO  systems  the  gain  is  a  frequency-dependent  function  which 
gives  the  ratio  of  the  magnitude  of  the  system  output  to  that  of  the  input. 
Thus,  for  scalar  inputs  of  the  form  (6.1): 

8ain  =  lyfc?  1 

8  ln  |u(t) I 


|L(sQ)u(t) 


|L(so) 


(6.7] 


For  MIMO  systems,  gain  is  a  function  of  the  direction  of  the  input  as  well 
as  the  frequency.  Thus,  for  a  vector  input  of  the  form  (6.1): 


lu(t)l 

iL(so)k» 


(6.8) 


where  the  standard  Euclidean  vector  norm  is  used.  In  view  of  the  directional 
dependence  of  the  gain  it  is  only  natural  to  ask  which  directions  yield  the 
largest  and  smallest  values  of  gain.  This  question  is  answered  using  the 
singular  value  decomposition  (SVD)  [37-39 ] •  This  canonical  form  of  a  matrix 
is  given  by  (dropping  the  functional  dependence  on  s  ) : 


L  -  VEU 


(6.9) 


A  complete  discussion  of  the  SVD  is  contained  in  Appendix  B.  A  summary  is 
given  here  for  the  case  in  which  the  singular  values  are  distinct.  The 

singular  values  of  L  are  given  by  the  nonnegative  square  roots  of  the  eigen- 

H  n  *n 

values  of  L  L.  It  is  assumed  here  that  L(sq) £  X  :  thus,  these  eigenvalues 

H  H 

are  identical  to  those  of  LL  .  (The  symbol  L  is  read  "L-Hermitian"  and  is 
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the  complex  conjugate  of  the  transpose  of  L.)  The  right  singular  vectors  of 
L  are  the  columns  of  U  and  are  the  eigenvectors  of  A.  The  left  singular 
vectors  of  L  are  the  columns  of  V  and  are  the  eigenvectors  of  LL  .  Assuming 
that  the  singular  values  are  distinct,  the  right  singular  vectors  are  unique 
up  to  multiplication  by  a  unit  magnitude  constant  eJ  .  The  left  singular 


vectors  are  then  uniquely  determined  from  the  relation 

Lu^,  “ 


(6.10) 


Note  that  choosing  a  different  right  singular  vector 


ia 

ui  "  e  ui 


(6.11) 


implies  that  the  new  left  singular  vector  is  given  by 


*  ja 
V  3  v 

i  i 


(6.12) 


It  is  common  to  order  the  singular  values  so  that 


-A  A 

,  a  m  a 


(6.13) 


(6.14) 


The  singular  values  and  vectors  have  many  useful  properties  which  they 
inherit  from  properties  of  the  eigenvalues  and  eigenvectors  of  Hermitian 
matrices.  For  example,  the  matrices  U  and  V  are  unitary:  UUH  =  I,  WH  =*  t. 
Thus,  the  sets  of  right  and  left  singular  vectors  form  an  orthonormal  basis 
for  Cn.  The  singular  values  possess  the  following  min-max  property 


[37-39] : 


n-i+1 


(6.15 


a  *  min  t*x  yj-—  dim  S 

uSS.  2 

u#0 

The  minimum  in  (6.15)  is  taken  over  all  subspaces  of  ®n  of  the  appropriate 
dimension  while  the  maximum  is  taken  over  all  nonzero  vectors  within  the 
given  subspace.  Thus,  in  particular, 


lLu?  2 

a[L]  -  max  j-t —  -  (6.16 

ufiC0  “2 
ui*0 


and 


a(L] 


min 

u6tn 

u^O 


(6.17 


If  det  L/0,  then 


a[L] 


(6.18 


Thus,  at  each  frequency, the  singular  value  decomposition  gives  the  directions 
of  the  inputs  to  the  system  which  are  multiplied  by  a  canonical  set  of  gains, 
the  values  of  those  gains,  and,  via  (6.10),  the  directions  of  the  resulting 
outputs.  Finally,  the  above  discussion  shows  that  letting  L  vary  as  a 
function  of  s  causes  the  singular  vectors  to  vary  in  accordance  with  the 
min-max  principle  (6.15). 
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6.5.  Phase  in  SISO  Systems 

The  value  of  a  scalar  transfer  function  is  given  at  a  specific 
frequency  by  its  gain  and  phase.  The  concept  of  gain  has  been  generalized  to 
matrix  transfer  functions  as  described  above;  however,  the  concept  of  phase 
has  presented  more  difficulty.  In  order  to  arrive  at  a  useful  notion  of 
phase  inanMIMO  system  it  is  instructive  to  examine  the  different  ways  in 
which  the  term  phase  is  used  in  SISO  systems  and  focus  on  those  which  are 
most  relevant  to  feedback  design.  To  do  this,  consider  the  way  in  which 
scalar  signals  interact  when  added  together. 

Suppose  one  is  given  two  nonzero  scalar  exponential  signals  with 
the  same  complex  frequency: 


ux(t)  =  c1eSot 

and  (6.19] 

u2Ct)  ■  c2eSot 

The  phase  difference  between  these  two  signals  may  be  uniquely 
defined  (modulo  2it)  as 

A0  *  *  c2  c^  .  (6. 20] 


The  importance  of  the  phase  difference  is  that  it  governs  how  the  signals 
interfere  when  added  together.  This  may  be  seen  from 


ju1(t)  +  u9(t) | 


e  ,|cl  +  c2  1 

ex°c  y I c1 [ 2  +  ]c2|-  +  2Re  c0c^ 

+  !  c2  *  2  +  2|c1|  |c2|  cos  20 


(6.21] 


2 


For  example,  if  the  phase  difference  is  (2k  +  l)ir,  k  an  integer,  then  the 
signals  interfere  purely  destructively  and 


|ul<t)  +  u2(t)|  -  eX°t||c1|  -  | c2 | 


(6.22) 


If  the  phase  difference  is  2kT,  then  the  signals  interfere  purely  construc¬ 
tively  and 

|ux(t)  +u2CO|  =  «*ot[|c1|  +  |c2|]  .  (6.23) 


If  the  phase  difference  is  not  an  integer  multiple  of  ir ,  then  the  magnitude 
of  the  sum  of  the  two  signals  is  bounded  below  and  above  by  (6.22)  and  (6.23). 
From  (6.3),  equations  analogous  to  (6.21)-(6.23)  also  hold  in  the  frequency 
domain ;  i . e . , 


u^ (s)  +  u2(s) I 


1 

s-s 

o 


(6.24) 


The  importance  of  the  relation  between  phase  difference  and  signal  interference 
in  feedback  systems  should  be  obvious  from  the  definition  of  e(s)  in  Figure  2.1 

Note  that  it  is  possible  to  compute  the  phase  difference 
between  u^t)  and  u2(t)  by  first  computing  the  phases  of  u^t)  and  u2(t) 
individually  and  then  subtracting.  First,  define 


This  definition  leads  to  another  way  of  representing  the  signals : 


u^t)  *  |  |  eS°t+^l 

(6.26) 

u2(t)  =  |c2|eS°t+62 

Thus,  the  phase  difference  between  two  signals  may  be  defined  in  two 
equivalent  ways  -  either  from  (6.20)  or  from 

A6  =  e2  -  0  .  (6.27) 

Note,  in  particular,  that  the  phase  difference  between  the  two  signals  is 
invariant  under  rotations  of  the  coordinate  system  in  the  complex  plane 
used  to  measure  the  phases  of  the  individual  signals;  the  phases  of  the 
individual  signals  are  not  invariant,  however. 


6.6.  Phase  in  MIMO  Systems 

For  vector  signals  no  standard  definition  of  phase  or  phase 
difference  exists.  The  possibility  of  such  definitions  is  studied 
mathematically  in  Chapter  10.  A  reasonable  approach  based  upon  physical 
considerations  seems  to  be  to  generalize  the  notion  of  phase  difference 
by  considering  the  way  in  which  two  vector  signals  .interfere  when  added 
together.  Thus,  consider  (6.19)  with  c^  and  c2<=in.  The  analogue  to  (6.21) 
is  (using  Euclidean  vector  norms) : 


£f 
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Equation  (6.28)  shows  that  the  interference  of  u^(t)  and  U2(t)  is  governed 
by  the  term  Re  c^c^.  Defining 

6  -  *  c2\  (6.29) 

and  recalling  the  definition  of  the  angle  between  one-dimensional  subspaces 
of  «n,  (6.28)  becomes 


lul(t)  +  u2(t)I 


+  2|  c^|  |  c2|cos<f>  cos0 


(6.30) 


where  <J>  is  the  angle  between  the  subspaces  spanned  by  u^  and  u^ .  Comparing 
(6.3)  with  (6.21)  reveals  that  the  interference  between  two  vector  signals 

7T 

depends  upon  the  angle  <p .  If  the  two  subspaces  are  orthogonal  (^-j).  then 

|u1(t)  +  u2(t)|  »  eXoC  / fcj2  +  |c2j2  .  (6.31) 

Thus,  in  this  case  the  signals  do  not  interfere  at  all.  If  <p  <  tt/2 ,  then 
interference  can  occur;  for  purely  constructive  interference 


ju^t)  +  u2(t)J  =  Jii1(t)|  +  |u2(t)|  (6.32) 

and  for  purely  destructive  interference 

1 ux(t)  +  u2(t)|  =  |!u1(t)|  -  |u2(t)| |  (6.33) 

it  must  be  that  the  two  signals  lie  in  the  same  direction  ($*0).  In 
addition,  9  *  2kir  is  required  for  (6.32)  and  9  *  (2k+l)iT  radians  are  required 
for  (6.33).  If  0  <  <j>  <  it/2,  then  interference  takes  place  between  one 
signal  and  that  component  of  the  other  signal  which  lies  in  the  same 


direction. 
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The  preceding  discussion  suggests  that  a  natural  generalization 
of  phase  difference  to  vector  signals  is  given  by  the  angle  0  which  determines 
how  the  signals  interfere.  If  the  signals  lie  in  orthogonal  directions,  then 
6  is  undefined.  This  is  only  natural  since  signals  in  orthogonal  directions 
cannot  interfere. 

Now  that  a  definition  of  phase  difference  between  two  vector  signals 
has  been  proposed,  does  there  exist  a  definition  of  the  phase  of  a  vector 
signal?  One  way  to  attempt  this  would  be  to  fix  a  vector  w£tn,  e.g.,  one 
of  the  standard  basis  vectors,  and  then  define  the  phase  of  a  vector  signal 
as  (compare  with  (6.25)) 


phase[u(t) ]  -  *  wHu(t) 


(6.34) 


This  definition  could  be  used  to  define  the  phase  of  all  vector  signals  in 
directions  not  orthogonal  to  that  of  w.  Computing  phase  difference  between 
two  signals  u^(t)  and  u^(t)  using  (6.34)  would  thus  be  identical  to  computing 
the  phase  difference  between  the  components  of  u^(t)  and  u2(t)  lying  in  the 
direction  of  w.  By  defining 


and 


2WHu1(t) 


t  -  0 


*vHu2(t) 


0 


a  measure  of  phase  difference  is  given  by 


(6.35) 


(6.36) 
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This  definition  will  not,  in  general,  yield  the  same  value  of  phase  difference 
as  that  given  by  (6.29)  and,  in  addition,  does  not  have  as  nice  a  physical 
interpretation.  Nonetheless,  this  definition  of  phase  difference  will  be 
useful  in  later  chapters. 

The  natural  definition  of  phase  difference  (6.29)  will  be  found 
useful  in  studying  properties  of  MIMO  feedback  systems.  The  relation 
between  (6.29)  and  matrix  transfer  functions  is  now  explained.  From  (6.5) 
it  can  be  seen  that  the  phase  difference  between  an  input  (6.1)  and  the 
resulting  output  of  the  system  whose  transfer  function  is  given  by  L(s)  is 
given  by 

0(sq)  -  *uH(t)y(t) 

*  *uH(t)L(sQ)u(t)  .  (6.37) 

The  angle  between  the  direction  of  the  input  and  that  of  the  resulting 
output  is  given  by 

|uH(t)y(t)| 

cos$(so)  =  II  u  ( t )  1  «L(so)u(t)« 

| uH(t)L(sQ)u(t)  | 

=  lu(t)«  lL(so)u(t)l  ‘  (6,38) 

Given  an  input  of  the  form  (6.1),  with  k  a  unit  vector,  the  quantity 
kH  L(sQ)k  =  |  L (sq) kj  cos>J>(so)e^  9 

thus  relates  the  gain  of  the  system  in  the  direction  K,  the  angle  between  the 
directions  of  the  input  and  the  resulting  output,  and  the  phase  difference 


between  the  input  and  output. 
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Given  a  matrix  MS  c  the  set 


{ u^Mu  :  Juj  •  1  ,  u£  Cn} 


(6.39) 


is  defined  to  be  the  numerical  range  [40]  of  the  matrix.  This  set  has  been 
identified  as  important  in  system  theory.  For  example,  a  necessary  condition 
for  (strict)  positive  realness  [41 ]  of  a  transfer  function  matrix  is  that 
the  numerical  range  of  L(s) ,  Re[s]>0,  be  confined  to  the  (open)  right  half 
plane.  Thus,  in  order  to  be  strict  positive  real,  a  transfer  function  matrix 
must  have  the  property  that  | 9 |  <  j  and  $ <  j  for  all  inputs  and  all  s  with 


Re[s]  >  0. 


In  the  next  section  two  parameterizations  of  matrix  transfer 


functions  will  be  developed.  These  parameterizations  will  use  angles  and 
phase  shifts  as  defined  in  this  chapter. 


I 


m 
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CHAPTER  7 

CHANGES  OF  BASIS  AND  PARAMETERIZATIONS 
OF  MATRIX  TRANSFER  FUNCTIONS 


7.1.  Introduction 

The  purpose  of  this  chapter  Is  to  introduce  a  change  of  basis  in 
the  spaces  of  inputs  to  and  outputs  from  a  system  described  by  a  transfer 
function  matrix.  As  discussed  in  Section  7.2  this  leads  to  transformed  open 
and  closed  loop  system  matrices  displaying  information  contained  in  the 
singular  value  decomposition  (SVD)  of  the  open  loop  transfer  function. 

In  particular  the  angles  between  singular  subspaces  and  the  phase  differences 
between  singular  vectors,  quantities  introduced  in  Chapter  6,  are  displayed. 

Using  these  parameters  lower  bounds  on  o[S(ja))]  are  derived. 
Implications  on  the  possible  values  of  these  parameters  given  that  an  upper 
bound  on  a[S]  is  satisfied  are  discussed  in  Section  7.3.  In  Section  7.4  some 
useful  parameterizations  for  transfer  functions  of  two-input  two-output 
systems  are  discussed. 


7.2.  A  Change  of  Basis 

Let  the  singular  value  decomposition  of  the  open  loop  transfer 
function  matrix,  L(s) S  cn  n,  be  given  by  (suppressing  dependence  on  s) : 

L  -  VIUH  (7.1) 

with  singular  values 

l  *  diag[c1  . . .  <?n]  (7 . 2) 


left  singular  vectors 


and  right  singular  vectors 


U  -  [xix  |  ...  |  un]  .  (7.4) 

In  contrast  to  convention  the  singular  values  are  not  necessarily  assumed  to 
be  numbered  in  decreasing  order  of  magnitude.  The  singular  values  and  vectors 
must  still  satisfy  the  equation, 


Lui  "  °ivi  *  (7.5) 

however.  Degrees  of  freedom  available  in  selecting  the  singular  vectors  are 
discussed  in  Appendix  B. 

It  will  be  convenient  to  write  the  spaces  of  inputs  and  outputs 
In  the  orthonormal  basis  for  Cn  furnished  by  the  right  singular  vectors. 

(The  left  singular  vectors  could  also  be  used  for  this,  of  course.) 

With  this  change  of  basis  the  matrix  L  is  transformed  into: 


L  -  U^I 

(7.6) 

Note 

L  =  ULU*1 

(7.7) 

and 

L  -  U^LU 

(7.8) 

The  sensitivity  and  complementary  sensitivity  functions  become 

S  =  [I  +  L]-1 
*  [I  + 


(7.9) 


[I  +  U^VE  ]_1  U^E 


and  are  related  to  S  and  T  via 

S  *  usuH 

and 

T  =  UTijn 


(7. 

(7. 


The  transformation  from  L,  S  and  T  to  L,  S  and  T  just  presented 
is  both  a  similarity  transformation,  preserving  the  eigenvalues  of  the 
matrices  involved,  and  a  unitary  transformation,  preserving  the  singular 
values.  The  former  fact  implies  that  absolute  stability  properties  of  the 
closed  loop  system  are  preserved  since  these  can  be  assessed  from  the 
determinant  [42]  or  the  eigenvalues  [43]  of  S(s)  and  T(s).  The  latter  fact 
implies  that  information  about  closed  loop  system  properties,  such  as 
stability  margins  and  disturbance  response,  is  preserved. 

Writing  L  in  terms  of  the  individual  singular  values  and  vectors 
(7 . 2)-(7 .4)  yields 

L  -  U^E 


H  H 

U1  Vlal  U1  V2a2 

H  H 

U2  Vlal  U2  V2°2 


(7.13) 


;1 

i 


iai 


H 

u  v  a 
n  n  n 
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This  expression  is  appealing  in  that  it  displays  the  angles  between  singular 
subspaces  given  by 

cos  -  ! u±Hv± !  (7.14) 

and  the  phase  difference  between  left  and  right  singular  vectors  defined 
by 

0i  "  4uiHvi  ‘  (7'15) 

(Physical  interpretations  of  and  0^  are  presented  in  Chapter  6.) 

Note  that  if  the  singular  values  are  distinct,  then  all  the  are 
uniquely  determined  and  0  <  <  tt/2.  If  t  ir/2  then  the  0^^  are  defined 

and  uniquely  determined  (modulo  2ir) .  If  there  exist  singular  values  with 
multiplicity  greater  than  one,  then  the  directions  of  the  corresponding 
singular  vectors  are  no  longer  uniquely  defined.  Given  a  choice  of  right 
singular  vectors,  however,  the  left  singular  vectors  are  uniquely  determined 
vis  (7.5).  Thus,  it  is  possible  to  define  and  9^  using  this  choice  of 
singular  vectors. 

The  transformations  (7.6),  (7.9)  and  (7.10)  are  useful  in  relating 
properties  of  the  open  loop  singular  values  and  subspaces  to  those  of  the 
closed  loop  system.  This  relation  will  be  studied  in  detail  in  Chapter  8 
for  the  case  when  there  exist  large  gains  in  some  directions  and  small  gains 
in  other  directions  over  the  same  frequency  range.  A  relation  which  holds 
for  the  general  case  is  presented  in  Section  7.3. 


7.3.  A  Lower  Bound  on  Closed  Loop  Sensitivity 

In  this  section,  a  lower  bound  on  o[S(ju))],  the  largest  singular 
value  of  the  system  sensitivity  function,  is  obtained  in  terms  of  the 
open  loop  singular  values  and  the  parameters  ^  and  9^  from  (7.14)  and  (7.15) 
First,  a  more  general  relation  is  obtained  using  the  numerical  range  of 
L(jui)  defined  in  Section  6.6. 

From  the  definition  of  the  sensitivity  function  (Chapter  5)  and 
various  properties  of  singular  values,  it  follows  that  (again  suppressing 
dependence  on  s) : 


ISI2  " 

-  o[(I  +  L)"1] 

1 

"  a[I  +  L] 

1  2  min  l  (I  +  L)wj „ 

w€®n 

1WS2  =  1 


(7.16 


From  (7.16)  it  follows  that,  given  any  unit  vector  w€(En,a  lower  bound  on  the 
magnitude  of  the  sensitivity  matrix  may  be  obtained  from 


:sfl2  s 


(I  +  L)wfl, 


(7.17 


The  lower  bound  (7.17)  can  be  written  as 


Now,  recall  w  Lw  is  an  element  of  the  numerical  range  of  L  (Section  6.6). 
Define 


IH2 

(7.19a) 

Lw 

P 

(7.19b) 

1  H  1 

arc  cos|w  z| 

(7.19c) 

*wHz 

(7.19d) 

Then,  (7.18)  becomes 

ISi2  >-J===  •  (7.20) 

/l+p  +  2p  cos<J>cos0 

i  0 

Now,  if  <{>  =  0,  i.e.,  if  w  is  an  eigenvector  of  L,  it  follows  that  peJ  is  an 

eigenvalue  of  L  and  the  relation  between  the  gain  p  and  phase  difference  0 

is  exactly  as  in  the  case  of  SISO  systems.  If  $  >  0,  however,  the  relation 

i9 

is  no  longer  as  in  the  SISO  case.  Apparently,  pe  ** -  1  can  occur  without 
necessarily  causing  a[S]  to  become  unbounded  or  even  large,  provided  that 
<)>  is  sufficiently  large.  Of  course,  inputs  in  directions  other  than  those 
given  by  w  could  produce  a  large  output  from  the  sensitivity  function. 
Nonetheless,  (7.20)  could  be  useful  in  showing  that  certain  open  loop  system 
properties  necessarily  lead  to  poor  feedback  properties.  Moreover,  (7.20) 
can  be  used  to  study  necessary  properties  of  systems  which  satisfy  bounds  on 
o[S],  e.g.,  optimal  linear  quadratic-state  feedback  regulators  designed  under 
certain  assumptions  (see,  e.g.,  [4,  p.  13]). 
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Note  that  (7.17)  can  be  evaluated  for  w,  a  right  singular  vector 
In  this  case  (7.20)  becomes 


/  2 

/I  +  a.  +  2o.  cosb.  cosQ. 


(7. 21] 


This  bound  is  useful  in  studying  the  relationship  between  the  canonical  set 
of  open  loop  gains  given  by  the  singular  values  and  feedback  properties. 

For  example,  if  lsl  2  <  1,  then  it  follows  that,  for  each  i, 

a.  +  2cosij!i  cosQ^  >  0  .  (7.22' 

One  possible  application  of  this  bound  would  be  to  study  properties  of 
linear  quadratic-state  feedback  regulators,  for  which  under  certain 
assumptions  (7.22)  is  guaranteed  to  hold.  Analysis  of  (7.22)  or  an  analogous 
bond  obtained  from  the  more  general  inequality  (7.20)  could  lead  to  new 
interpretations  of  the  guaranteed  properties  of  these  designs. 


i£ 


*  *  * 
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7.4.  Parameterizations  for  Two-Input  Two-Output  Systems 

In  this  section,  two  useful  parameterizations  of  transfer  function 
matrices  for  two-input  two-output  systems  are  described.  Such  parameteriza¬ 
tions  are  undoubtedly  also  possible  for  larger  systems.  The  emphasis  in  this 
thesis,  however,  will  be  to  ascertain  whether  useful  insights  are  obtained 
for  simple  cases.  If  useful  results  can  be  obtained  for  simple  cases,  then 
this  will  motivate  further  work  on  larger  systems. 


Suppose  W  is  a  unitary  matrix  in  £  and  suppose  further  that  the 
diagonal  elements  of  W  are  nonzero.  (It  can  easily  be  verified  that  =  0, 
if  and  only  if  w22“0*)  Then  W  can  be  parameterized  as 


W  = 


cosb  e- 


i0l 


.  -j’J'  1 

-sinb  e  e 


0o  “! 


1*  j 

sin^  eJ  eJ 


8- 


cos<j>  e- 


02 


(7.2 


where  b €  [0,W2)  and  the  0^  and  ip  can  be  chosen  to  lie  in  the  interval 
(-TTjTr].  (Note  these  parameters  are  not  the  same  as  those  of  (7.14)  —  (7.15) .) 
This  parameterization  may  be  used  in  the  SVD  (7.1) : 


L  =  VZUH 
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Note,  that  (7.24)  contains  10  parameters;  this  is  apparently  too  manv  since 
eight  numbers  suffice  to  specify  a  2*2  complex  matrix.  Inspection  reveals 
that  the  value  of  L  depends  only  on  the  quantities 


Thus  (7.24)  can  be  written 


vsu11 


cos$v  -sin$v  e 


I'J'v 

sin4>v  e  cos$v 


jel 

cr^  eJ  "L  0 


0  o„  e- 


i02 
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COS(Jl 


-sin<j>  e 
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-J*u 


I  H 


Jvu 


sin<j)  eJ  cos<j> 

Tu  u 


From  (6.35)-(6.36)  it  can  be  seen  that  0^  is  the  difference  in  phase  between 
the  components  of  and  lying  in  the  direction  of  the  standard  basis 

Thus,  0^  does  not  measure  how  an  input  in  the  direction 
u^  will  interfere  with  the  resulting  output  which  is  in  the  direction  v^. 
Rather  0^  determines  how  the  indicated  components  of  u^  and  v^  interfere. 
Similar  remarks  can  be  made  about  the  meaning  of  0^  using  the  basis  vector 


0 


vector  e^  = 


The  angles  $  and  <1^  indicate  how  an  input  written  in  the  standard 
basis  is  multiplied  by  the  singular  value  gains.  Similarly,  the  angles 
and  t|i  indicate  how  the  resulting  output  is  distributed  between  the  standard 
basis  directions.  Thus,  although  the  phases  9^  do  not  have  as  nice  an 
interpretation  as  those  given  by  4-u.  v. ,  the  corresponding  parameterization 


of  the  singular  subspaces  is  useful  for  relating  properties  of  the  SVD 
to  design  goals  stated  in  terms  of  the  physical  loops  of  the  system. 

Parameterizations  can  also  be  obtained  for  the  matrix  U®V  in 
(7.6),  the  expression  for  L.  Thus,  if  u^v^O,  U^V  may  be  parameterized 
as 


where  cos<f>  *  |  v^|  is  the  cosine  of  the  angle  between  each  set  of  left  and 

right  singular  subspaces;  <j>€  (0,tt/2)  .  Since  t>  <  */ 2,  the  phase  differences 

9i  =  *ui  Vi  are  de^ined  and  be  chosen  so  that  9^  (—it  , tt ] .  The  quantity 

is  not  well  defined,  however,  due  to  a  lack  of  uniqueness  in  the  choice  of 

singular  vectors  (Appendix  B) .  Thus  it  may  be  assumed  that  tp  =  0.  To  justify 

this,  recall  that  if  there  exist  well-defined  one-dimensional  right 

2 

singular  subspaces  11^  and  11^  £  C  .  The  right  singular  vectors  may  be  chosen 
as  any  unit  vectors  within  these  subspaces;  the  left  singular  vectors  are 


then  completely  determined  from  (7.5).  Suppose  that  for  a  particular  choice 
u^ll^  it  so  happens  that  ip/0;  i.e.,  ^u^v^  +  6  .  Then  choose  =  e'^u 

A  _  4  \p  U. 

(which  from  (7.5)  implies  v,  -  e  J  v,);it  follows  that  *u2  v,  = 


9,  and  'i>  = 


0. 


Finally,  (7.27)  can  be  used  in  (7.6)  to  obtain 

L  = 


(7.2 


This  transformation  of  L  displays  useful  information  concerning 

how  inputs  to  L  in  the  direction  of  the  right  singular  subspaces  interfere 

with  the  resulting  outputs  when  added  together. 

Although  there  are  only  five  parameters  displayed  in  (7.28)  eight 
parameters  are  still  needed  to  completely  specify  the  matrix.  The  reason 
is  that  three  parameters  are  needed  to  specify  the  basis  of  right  singular 


vectors.  Ordinarily  four  parameters  are  needed  for  this;  e.g., 

ej0ul  -sin*  ej*u  ej9u2  1 

u 


cos*  e 
u 


.in.  «.i'*V9ul 


cos*  e 
u 


It  will  be  recalled,  however,  that  9u^  was  adjusted  to  make  the  parameter  * 
in  (7.27)  equal  to  zero.  On  the  other  hand,  fixing  9^  *  9u  =0  implies  that 
*  is  well  determined.  Thus  in  the  former  case,  L  is  completely  determined 


{o,  ,  a„,  9.,  9-,  *,*,*,  9  } 

1  2  1  2  ’  u  u  U2 


while  in  the  latter  case  L  is  completely  determined  by 


The  five  parameters  in  (7.28)  appear  to  be  the  most  useful  in 
approximating  feedback  system  properties  in  terms  of  open  loop  system 
parameters.  However,  as  the  discussion  of  structured  design  specifications 
for  MIMO  systems  showed  (Chapter  5) ,  information  about  the  directions  of  the 
right  singular  subspaces  is  important  in  stating  design  goals.  Thus  both 
parameterizations  display  valuable  design  information. 


CHAPTER  8 


SYSTEMS  WITH  HIGH  AND  LOW  GAINS  AT  THE  SAME  FREQUENCY 

8.1.  Introduction 

The  purpose  of  this  chapter  is  to  provide  approximations  to  the 
sensitivity  and  complementary  sensitivity  functions  of  an  MIMO  feedback 
system  in  terms  of  properties  of  the  open  loop  transfer  function.  The  case 
considered  is  that  in  which,  over  some  frequency  range,  this  transfer  function 
has  much  higher  levels  of  gain  in  some  directions  than  in  others.  Motivation 
for  studying  this  case  was  provided  by  the  discussion  of  structured  distur¬ 
bance  inputs,  sensor  noise,  and  modelling  uncertainty  in  Chapter  5.  The 
assumption  of  such  structure  naturally  leads  to  systems  whose  open  loop 
transfer  function  has,  at  the  same  frequency,  high  levels  of  gain  in  directions 
for  which  good  performance  is  desired  and  small  gain  in  other  directions  to 
provide  stability  robustness  and/or  sensor  noise  rejection.  An  additional 
motivation  is  provided  by  the  observation  that  a  given  plant  may  naturally 
possess  different  bandwidths  in  different  directions.  To  facilitate  design  of 
a  compensator,  it  is  useful  to  know  how  this  property  affects  the  closed  loop 
system.  Finally,  when  the  plant  has  an  unequal  number  of  inputs  and  outputs, 
one  of  the  two  open  loop  transfer  functions,  L  =  PF  or  »  FP,  will 
necessarily  have  less  than  full  rank.  Hence,  such  a  system  will  also 
necessarily  have  different  bandwidths  in  different  directions . 

Portions  of  this  chapter  have  appeared  in  [29]  and  [44]  and  will 


appear  in  [45]. 
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8.2.  Approximations  to  S  and  T  In  Terms  of  Open  Loop  System  Properties 

It  is  desired  to  study  systems  which,  at  the  same  frequency,  possess 
much  higher  levels  of  gain  in  some  directions  than  in  others.  Thus  it  is 
convenient  to  partition  the  singular  value  decomposition  (SVD)  of  the  open 
loop  transfer  function  as  (suppressing  dependence  on  the  complex  frequency 
variable  s) : 


or 


L 


[Vi  !  V2] 


V1Z1U1H  +  V2Z2U2H 


(8.1) 


where  »  o[Z2]  and  with  U1<=Cn*k,  V2 ,U2 €  cnX (n_k) , 

and  E2€«^n  ^  k^ .  Inputs  to  the  open  loop  system  lying  in  the  subspace 

(the  column  space  of  U^)  are  amplified  by  the  level  of  gain  given  by  the 

singular  values  of  E^  and  appear  as  outputs  in  the  subspace  ^  (the  column 

space  of  .  This  partition  suggests  that  L  be  referred  to  as  consisting 

H  H 

of  a  high  gain  subsystem  )  and  a  low  gain  subsystem  (V2^2U2  ^  (with 

the  caveat  that  the  partition  need  not  correspond  to  physical  subsystems) . 

It  is  convenient  to  write  the  spaces  of  inputs  and  outputs  in  the 
basis  of  right  singular  vectors;  i.e.,  the  columns-of  U.  The  matrices  L  and 
S  written  with  respect  to  this  basis  are  given  by 


L  -  U^LU 
-U^E 


(8.2) 


Since  L  and  S  are  related  to  L  and  S  by  a  unitary  transformation,  the  singular 
values  of  L  and  §  are  the  same  as  those  of  L  and  S.  Note  that  (3.2)  and 
(8.3)  are  also  similarity  transformations;  thus  eigenvalues  are  also  preserved. 
This  fact  is  appealing  since  various  system  properties  quantified  by  singular 
values  and  eigenvalues  are  preserved  under  the  transformations. 

Rewriting  (8.2)  in  terms  of  the  partition  (8.1)  yields 


rciVi  uiV2 


°2Vl  °2\Z2 


(8.4) 


The  matrices  Uj.  v  give  a  measure  of  the  alignment  of  the  subspaces  LL  and 


Vj  (the  column  spaces  of  and  V ^ ) .  The  singular  values  of  U^*V.  are  the 


cosines  of  the  canonical  angles  between  the  subspaces  1L  and  (see 


Appendix  D  for  a  discussion  of  these  angles) .  The  canonical  angles 
generalize  the  usual  notion  of  angle  when  the  subspaces  are  one-dimensional. 
If  all  the  canonical  angles  are  equal  to  zero  then  11.  =0..  If  all  these 


angles  are  equal  to  ’t/2  then  11  ^  n  1)^  =  the  empty  set.  Similarly  the  sin¬ 


gular  values  of  ,  i# j  ,  are  the  sines  of  the  canonical  angles.  In  system 


H. 


terms  the  matrix  Vj  indicates  how  much  of  the  output  of  the  transfer  func¬ 


tion  L  lying  in  the  subspace  is  due  to  inputs  lying  in  the  subspace  LL.  If, 

H. 


at  some  frequency,  *  0,  i# j ,  then  the  high  gain  and  low  gain  subsystems 

are  decoupled  (at  the  given  frequencv) .  Note  that  in  terms  of  the  feedback 


system  of  Figure  5.1  this  means  that  the  output  of  the  high  gain  subsystem  is 
fed  completely  back  to  the  input  of  this  subsystem.  A  similar  statement  holds 
for  the  low  gain  subsystem.  If  f  0,  i^j  ,  then  part  of  the  output  of  the 

high  gain  subsystem  is  fed  back  to  the  input  of  the  low  gain  subsystem  (and 
conversely) .  The  effect  of  this  behavior  upon  the  sensitivity  and  complemen¬ 
tary  sensitivity  functions  will  now  be  examined. 

The  transformed  sensitivity  matrix  (8.3)  may  be  written  in  terms  of 
the  partition  (8.4)  of  L  into  subsystems  as 


I+ui\Ei  "1V2 


-1 


U2Vl 


t  +  vi\h 


(8.5) 


Using  a  standard  formula  for  the  inverse  of  a  partitioned  matrix,  the  trans¬ 
formed  sensitivity  function  can  be  written  as  (assuming  det  A#0  and 
det  C/0) : 


S  = 


A_1+  BC-1D 


-C-1D 


-BC 

-1 


-1 


where 


A  -  I  +  U1HV1I1 
B  =  (I  +  U^V^)"1 


c  -  (I  +  U2HV2E2)  -u2hv1i1ci  +  u1hv1e1)"1 
D  -  U^V^U  +  U^E^"1 


(8.6) 


(8.7a) 

(8.7b) 


(8.7c) 


(8 . 7d) 


If  the  gain  in  both  subsystems  is  sufficiently  greater  than  one, 
then  S 58  0  and  T**I.  If  the  gain  in  both  subsystems  is  sufficiently  small. 


then  S 88 1  ana  T^O.  These  approximations  were  discussed  in  Chapter  5.  The 
following  theorem  yields  an  approximation  to  S  and  T  when  the  gain  in  one 
subsystem  is  sufficiently  large  and  the  gain  in  the  other  subsystem  is 
sufficiently  small. 

In  the  following  it  will  be  assumed  that  none  of  the  canonical 
angles  is  equal  to  ir/2;  i.e.,  it  is  assumed  that  det[U^^V^]  t  0.  An 
analysis  of  this  special  case  can  be  performed  using  the  pseudoinverse  of 

u 

U±  V^;  however,  the  resulting  expressions  are  complicated.  In  Section  8. A 

the  analysis  of  this  case  is  performed  for  two-input  two-output  systems. 

H 

Theorem  8.1:  Assume  det[U^  V^]#0.  Then,  if 

al^]  »5[(U1HV1)“1]  (8.8) 

and 

a[Z2]  «  o[V2V,]  (8.9) 


the  sensitivity  function  and  complementary  sensitivity  function  are 

* 


approximated  by  the  following  equivalent  expressions : 


U2(V2HU2)'*1V2H 


S*  I  U2(V2Hu2)"1(V2Hu1K1H  +  U2U2H 


Vl  (VlHV  (v2HV  "1v2H  +  V2V2H 


(8.10a) 

(8.10b) 

(8.10c) 


A  caveat  is  in  order  concerning  approximations  (8 
in  the  remainder  of  this  chapter.  For  example,  (8 
is  singular,  rather,  just  that  a [S  -  U2(V2  u2)~^V2” 
small.  How  small  is  "small  enough"  depends  on  the 
he  magnitude  of  the  disturbances  to  be  rejected, 
which  the  gain  ifEjJ  must  exceed  5[(UiHV]_)~1]  and 
be  below  2[V2%25  order  to  insure  t8 .10)-(8.11) 
application.  Again,  similar  remarks  hold  for  the 
chapter . 


.10) -(8. 11)  as  well  as  those 
.10)  does  not  imply  that  S 
]  may  be  made  arbitrarily 
specific  application;  e.g., 
Similarly,  the  amounts  by 
by  which  the  gain  a[I2l  must 
hold  will  depend  upon  the 
other  approximations  in  this 


vi(ui  V  ui 


(8.11a) 


T  *  |  u2(u2hv1ku1hv1)"1u1h  +  u1u1h 

V!  (Ui\)  _1  )  V2H  +  V1V1H 


(8.11b) 

(8.11c) 


Proof :  See  Appendix  E.  ■ 

Expression  (8.10a)  shows  that  the  response  of  the  feedback  system 
to  disturbances  lying  in  the  subspace  1)^  (which  contains  the  output  of  the 
high  gain  subsystem)  can  be  made  arbitrarily  small.  Similarly,  the  distur¬ 
bance  response  is  confined  to  the  subspace  112  (which  contains  the  input  to 
the  low  gain  subsystem) .  The  magnitude  of  the  sensitivity  function  is 


o[S] 


cos  a 


(8.12) 


where  a  is  the  largest  canonical  angle  between  the  subspaces  112  and  02. 

Thus  in  order  that  the  system  response  to  disturbances  not  be  excessively 

large  it  is  necessary  that  the  high  gain  and  low  gain  subsystems  should  be 

sufficiently  decoupled.  Expression  (8.10b)  shows  that  the  disturbance 

response  of  the  system  consists  of  two  components.  First,  disturbances  in 

the  subspace  U2  are  fed  directly  through  to  the  output  of  the  system.  This 

is  analogous  to  the  SISO  case  for  which  | L  j  < <1  implies  y.^d.  Second, 

disturbances  in  the  subspace  are  amplified  by  the  system;  the  resulting 

output  lies  in  the  subspace  U2>  The  amount  of  amplification  is  given  by  the 

H  H 

singular  values  of  the  matrix  (V2  U2)  (\72  U^) ,  which  in  turn  are  equal  to 

the  tangents  of  the  canonical  angles  between  the  subspaces  U2  and  This 

behavior  (which  has  no  analogue  in  SISO  systems)  is  apparently  due  to  a 
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component  of  the  output  from  the  high  gain  subsystem  being  fed  into  the  input 
of  the  low  gain  subsystem  and  conversely.  Equation  (8.10c)  may  be  used  to 
decompose  the  disturbance  response  in  terms  of  the  outputs  of  the  high  and 
low  gain  subsystems. 

Remarks  similar  to  those  above  can  be  made  about  the  system 
response  to  sensor  noise.  This  may  be  accomplished  using  approximations 
(8.11)  to  the  complementary  sensitivity  function. 

Consider  now  the  problem  of  structured  sensor  noise  posed  in 

Section  5.4  and  the  resulting  design  specification  (5.38).  Suppose  the 

attempt  is  made  to  satisfy  this  specification  by  requiring  the  gain  of  the 

system  to  be  small  on  Ng,  the  subspace  of  the  input  containing  the  structured 

component  of  the  sensor  noise.  This  may  be  stated  by  requiring  Ll^N  and 

z  s 

a[I2]<<l.  Then  approximation  (8.11a)  shows  that  this  procedure  does  indeed 
prevent  ng,  the  structured  component  of  the  sensor  noise,  from  producing 
a  response  in  the  system  output.  Moreover,  maintaining  large  gain 
(a [ Z  ] >>1)  in  the  orthogonal  subspace  insures  that  the  disturbance 

"  I  S  X 

response  will  be  confined  to  the  subspace  II.,. 

In  order,  however,  to  prevent  the  system  from  amplifying  disturbances 
and  sensor  noise  in  some  directions  with  greater  than  unity  gain,  it  is  neces¬ 
sary  that  the  interaction  between  high  and  low  gain  subsystems  be  small. 

Thus  in  addition  to  the  requirements  o[I2]<<1,  U2 Ng  and  a [ ] >>1  it  is 
necessary  to  require  P2asU2.  Hence,  the  use  of  both  high  and  low  gains  in  a 
design  can  achieve  some  of  the  feedback  properties  expected  from  the  SISO 
case,  but  only  if  the  subsystems  are  approximately  decoupled. 

If  the  gain  in  both  subsystems  is  sufficiently  greater  than  one, 
then,  as  discussed  in  Section  5.3,  the  approximations  S^O  and  T55  I  hold. 


When  the  gain  in  one  subsystem  has  been  "rolled  off"  while  that  in  the 

other  is  still  large, then  Theorem  8.1  may  be  used  to  approximate  feedback 

properties.  Recall  the  condition  for  the  low  gain  subsystem  to  be  rolled  off 

is  o[I2]<<  condition  guarantees  that  the  matrix 

H  H 

V2  U2  +  ^2^^2  ^2*  Note  this  is  a  more  stringent  requirement  than  the  SISO 
analogue  which  required  that  | L |  < <1  in  order  to  guarantee  l  +  L*®!. 

This  discussion  motivates  the  investigation  of  whether  a  frequency 
region  analogous  to  gain  crossover  frequency  in  SISO  systems  exists.  Presum¬ 
ably  such  a  region  would  exist  for  crossover  of  each  of  the  low  and  high  gain 
subsystems.  This  is  now  investigated. 

Lemma  8.2:  Assume  det[U^V^]  +  0  and  det[V2^U2  +  ^21  ^  ® •  Then  if 

a[E1J»  (8.13) 

the  transformed  sensitivity  function  (8.6)  is  approximated  by 


§11  =*  z^1(u1\)"1ti- (u1Hv2)E2(v2Hu2  +  E2)“1(v2Hu1)] 

(8.14a) 

§12  ^-z^1(u1Hv1)"1(u1Hv2)E2(v2Hu2  +  e2)“1(v2hu2) 

(8.14b) 

S21  »  (V2HU2  +  Z2)"1(V2HU1) 

(8.14c) 

s22  *  CV2HH2  +  E2)-1<V2HU2) 

(8 . 14d) 

■ 

Proof:  See  Appendix  E. 

■ 

Theorem  8.3:  Assume  det[U^*V^]  t  0  and  detfV.,*\j„  +  Z„]  i  0. 

Then  if 

£ [ ]  is  sufficiently  large  the  sensitivity  function  and  complementary 
sensitivity  function  are  approximated  by  the  following  equivalent 


expressions: 
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U2CV2HlJ2  +  V"1V2H 

(8.15a) 

U2CV2HtJ2  +  E2)_1[CV2HlJl)UlH  +  (v2Hu2)U2H] 

(8.15b) 

tV2(V2HU2)  +  VL(V  ^2*  ^2^  2  +  Z2)"1v2H 

(8.15c)  a 

-  -i 

V1(U1HV1)"1U1H  +  U2(V2Hu2)~li:2(V2Hu2  +  E2)"1v2H 

(8.16a)  V 

+  t-V1(V1HU2)  +  V2Z2](V2Hu2  +  E2)_1v2H 

(8.16b) 

U1U1H  +  U2(V2Hu2  +  E2)"1[_(V2HU1)U1H  +  E2U2H] 

(8.16c) 

Proof :  See  Appendix  E.  ■ 

Lemma  8.2  shows  that  the  frequency  range  for  which  there  exists  i,  ^ 

H  — 

1  <  i  <  n,  such  that  a[I2]  -  ai^V2  ^2^  -  P^ays  a  r°le  analogous  to 

scalar  gain  crossover  frequency.  Near  gain  crossover  frequency  in  SISO  systems 
knowledge  of  open  loop  gain  is  insufficient  to  allow  the  approximation  of  ia* 

feedback  properties.  Similarly* at  frequencies  for  which  the  conditions 

a[I2J«  a[U2HV2J  (8.17) 

and 


o[Z2]»  afU^] 


(8.18) 


both  fail  to  hold,  knowledge  of  open  loop  singular  values  does  not  provide 
sufficient  information  to  determine  feedback  properties.  For  SISO  systems 
the  magnitude  of  the  sensitivity  and  complementary  sensitivity  functions  are 
determined  by  open  loop  phase.  Some  generalizations  of  this  were  presented 
in  Section  7.3.  Further  generalizations  are  given  later  in  this  chapter. 


,v. 

>  •  ■  i  •  ■  » '  o  » ■ 
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Note  that  the  magnitude  of  o[(v2**U2  +  ^  re^ate<^»  via 

(8.14a,b),  to  the  level  of  gain  required  in  the  high  gain  subsystem  to 
insure  that  the  component  of  the  disturbance  response  lying  in  the  subspace 
is  small. 

The  following  lemma  and  theorem  are  a  counterpart  to  Lemma  8.2 
and  Theorem  8.3  for  the  frequency  range  over  which  the  high  gain  subsystem 
is  being  rolled  off. 

Lemma  8.4:  Assume  det[U2**V2]  ^  then  if 

ct[E2] <<  a[(U2HV2)_1]  (8.19) 

the  transformed  sensitivity  matrix  (8.6)  is  approximated  by  (assuming  the 
indicated  inverse  exists)  : 


,  ^5 

11 

(8.20a) 

12  * 

-[i  +  u1hv1e1  -  (u^^)  (u2hv1e1)]'1(u1hv2e2) 

(8.20b) 

21 

-(u2\li)[I  +  UiVi  ’  (UlHv2E2)(U2HviEl)]"1 

(8.20c) 

as 

22 

i  +  (u2Hv]_r1)[i  +  u1Hv'1z1  -  (u1hv2:2)(u2hv1e1)]"1(u1hv2:2) 

(8 . 20d) 

■ 

Proof :  See  Appendix  E.  » 

Theorem  8.5:  Assume  detfU^*^]  #  0  and  det  (I  +  U^^V^Z^)  t  0.  Then  if 
j[~2]  is  sufficiently  small,  the  sensitivity  function  and  complementary 
sensitivity  function  are  approximated  by  the  following  equivalent  expressions 


(8.21b)  : 


(8.21c)  a 


(8.22a) 


(8.22b) 


(8.22c) 


'  ^2  +  Vl\>_1<1  + VW'V  (8.21a) 

S  85  u2U2H+  [Ui“u2(u2HV1)I1](I+U1HV1E1)"1U1H  (8.21b 

V2V2H  +  vi(i+IiUi\)"1[ViH-Z1(U1HV2)V2H]  (8.21c 

V^d*^1^^)'1^11  (8.22a) 

T  58  \  V1E1(I  +  U1HV1E1)-1[(U1HV1)V1H  +  (U^v/]  (8.22b) 

IU1(U1HV1)  +  ^(D^)]  1^1  + U^E^"1^11  (8.22c) 

■ 

Proof :  See  Appendix  E.  ■ 

Theorem  8.5  shows  that  the  frequency  range  for  which  there  exists 
i,  1  <  i  <  n,  such  that  ad^]  <  ^[(Ul^^)  <  a [E^],  plays  a  role  analogous 

to  scalar  gain  crossover  frequency  when  the  high  gain  subsystem  is  being 
rolled  off.  Comments  similar  to  those  following  Theorem  8.3  can  be  made 
concerning  system  properties  in  this  frequency  range. 

The  approximations  to  the  sensitivity  and  complementary  sensitivity 
functions  given  by  Theorem  8.1-8. 3  may  be  used  to  obtain  approximations  to  the 
singular  values  of  these  matrices.  Recall  that  and  0^  are  k-dimensional 
subspaces  of  ®n  and  U2  and  are  (n-k) -dimensional. 

From  (8.10a)  it  follows  that  k  singular  values  of  S  are  small,  while 


(n-k)  singular  values  are  approximated  by 


a,  [S]  *  - - - 

i  cos  a. 


i=l, . . .  ,n-k 


(8.23)  ^ 


•T  If 
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where  (c^  ;  i=l,...,n-k}  are  the  canonical  angles  between  the  subspaces 
U2  and  l?2*  Similarly,  from  (8.11a)  it  follows  that  (n-k)  singular  values 
of  T  are  small, while  k  singular  values  are  approximated  by 


where  (6i  ;  i=l,...,k}  are  the  canonical  angles  between  the  subspaces  11^ 
and  P^.  Assume  that  n-k  <  k.  Then  from  Appendix  D  it  follows  that  2k-n 
nonsmall  singular  values  of  T  are  approximately  one  while  the  remaining 
nonsmall  singular  values  are  approximately  equal  to  the  nonsmall  singular 
values  of  S.  A  similar  statement  holds  if  n-k  >  k  except  that  n-2k  singular 
values  of  S  are  approximately  one. 

From  (8.15a)  it  follows  that  the  (n-k)  nonsmall  singular  values  of 
S  are  approximated  by 


oi[S]  »  ailCv2HU2  +  £2)~1]  1"1»* 


(8.25) 


The  approximation  of  the  singular  values  of  T  is  more  difficult  since  the 
approximations  (8.16)  are  not  in  the  form  of  a  singular  value  decomposition 
as  are  those  in  (8.15).  However,  some  useful  information  may  still  be 
obtained.  For  example,  let  the  singular  values  of  S  and  T  be  numbered  in 
order  of  decreasing  magnitude.  Then,  from  the  identity  T+S  *  I  and  Theorem 
6.6  of  Stewart  [37],  it  follows  that 

|ff±[S]  -  ai[T]|  <  1  .  (8.26) 

In  particular,  when  o^[S]  is  large  so  is  o^[T]. 

One  alternate  approximation  to  the  singular  values  of  T  may  be 
obtained  from  (8.16b).  Since  the  nonzero  singular  values  of  are  eciual 

to  one  it  follows  that  a^[T]  is  large  if  and  only  if 


I 


ai{["Vl(VlHlJ2)  +  V2Z2]  (V2Hu2  +  Z2)"]’V2H} 


is  large.  In  this  case 


oi[T]  *  /a^tCVi^KVz^  +  Z2)_lj  +  ai[Z2(V2Hu2  +  Z2)'1]  *  <8‘27) 

This  approximation  is  useful  when  it  is  desired  to  detect  large  values  of 
a^[T],  since  the  approximation  is  valid  exactly  in  this  case.  Similar 
approximations  to  c^fT]  and  a^S]  may  be  obtained  from  Theorem  8.5.  The 
approximation  analogous  to  (8.25)  is  given  by 

o±[T)  *  ai[Z1(I  +  U1HV1Z1)“1]  .  (8.28) 

Note  that  a^S]  is  large  if  and  only  if  o^  [U^  -  U^l^11^)^]  (I  +  Uj^Vj^)  } 

is  large.  In  this  case 

ai[S]  *  ^[(I  +  U^IjT1]  +  ^[(U^)  ^(I  +  U^V^jT1]  .  (8.29) 


3.3.  Two-Input  Two-Output  Systems 

The  purpose  of  this  section  is  to  specialize  the  results  of 

Section  8.2  to  systems  with  two  inputs  and  two  outputs.  In  order  to 
accomplish  this  a  parameterization  of  such  matrices  introduced  in  Section  7.4 
will  prove  useful. 


Consider  the  transformed  open  loop  transfer  function  (8.2): 


the  magnitudes  of  the  sensitivity  and  complementary  sensitivity  functions 
are  approximated  by 


o[S]  * 


1 

COS* 


(8.3 


o[T]  ** 


1 

cos  4) 


(8.3 


Proof :  Follows  easily  from  (8.10a),  (8.11a)  and  the  fact  that 

lUj^  VjJ  =>  |u2  v2|  -  cos*.  ■ 

Corollary  8.7;  Assume  cost))  +  0  and  cos*  +  a2  eJ  2  ^  q.  Then  if  is 
sufficiently  large  the  magnitude  of  the  sensitivity  function  is  approximated 
by 

o[ S]  * - - - rr~  •  (8.3 

| cos*  +  o2  2 |  m 


Proof:  Follows  from  (8.15a)  by  noting 


+  z2)_1v2h] 


lv2  u2  +  J2> 


and  from 

!v2Hu2  +  J2|  *  (cos*  e”-’"2  +  a ^  I 

=  |  cos*  +  a2  e^2|  .  i 

Corollary  8.8:  Assume  that  cos*  t  0  and  |l  +  cos*  ^  0.  Then  if 

o2  is  sufficiently  small  the  magnitude  of  the  complementary  sensitivity 
function  is  approximated  by 


j  1  +  cos<$>  eJ  1 1 


Proof:  Follows  from  (8.22a)  by  an  argument  similar  to  the  proof  of 

Corollary  8.7.  " 

From  Corollary  8.7  it  may  be  seen  that  at  frequencies  for  which 
a  cos<f>  it  is  important  to  keep  0£  bounded  away  from  ±ir .  Thus  such 
frequencies  bear  the  same  relation  to  the  low  gain  subsystem  as  gain  crossover 
frequency  does  to  anSISO  system.  Similarly  from  Corollary  8.8  it  follows  that 
at  frequencies  for  which  it  is  important  to  maintain  0^  bounded 

away  from  ±tt.  Thus  for  each  subsystem  there  exists  a  generalization  of  the 
concept  of  crossover  frequency;  moreover;  at  such  frequencies  the  value  of  a 
generalized  measure  of  phase  difference  must  be  kept  bounded  away  from  ±ir  in 
order  to  maintain  good  feedback  properties. 

The  results  of  Section  8.2  and  the  Corollaries  of  this  section  are 
valid  only  if  the  canonical  angles  between  subspaces  11^  and  (equivalently 
II2  an<*  ^2^  are  ^ess  c^an  2"  An  analysis  of  feedback  properties  may  also  be 
performed  in  this  case;  however,  the  expressions  are  rather  complicated.  For 
systems  with  two  inputs  and  two  outputs,  however,  the  expressions  are  simple 

enough  to  be  insightful.  An  analogue  to  Theorem  8.1  is  now  presented.  Note 

H  H 

that  the  condition  detfl^  V£]  #  0  imposed  in  that  theorem  reduces  to  V2/O. 

JJ 

Theorem  8.9:  Assume  ^2  v^  ■  0.  Then  there  exist  'two  cases: 

(a)  If  a.a,  >>  1,  then  the  sensitivity  function  is  approximated 


(b)  If  o^o 2  <<  1,  then 


_  ^  T  /  H  .  H  ,  H  .  H 
S  »  I  -  (0^2  vi)u2ui  “  (°2ui  v2^ulu2 


Proof :  Follows  easily  from  noting  that  u2  v2  =  0  implies 


-ul  v2c2 


H 

u2  vlal 


(8.40 


(8.41 


(1  +  a1o2) 


The  preceding  theorem  shows  that  if  cos$  =  0  and  >>  1  >>  a2 

then  a[S]  will  necessarily  be  large.  In  case  (a),  o[S]  *  — ,  and  in  case 

°2 

_  h 

(b) ,  a[S]  38  a^.  (This  follows  since  |u^  |  *  1,  i^j.)  In  either  case 

feedback  properties  are  poor.  This  is  plausible  since,  as  shown  in 

Collary  8.6,  a[S]  becomes  larger  as  the  angle  $  becomes  smaller.  Finally, 

it  might  be  expected  that  some  sort  of  unusual  feedback  behavior  would  occur 

when  cos$  *  0.  This  is  plausible  since  (i)  if  this  condition  holds  the 

outputs  of  each  subsystem  are  being  fed  completely  into  the  inputs  of  the 

other  subsystem  and  (ii)  the  two  subsystems  have  very  different  properties 

in  terms  of  levels  of  gain.  Thus,  as  a  signal  travels  around  the  loop  it  is 

multiplied,  alternately,  by  high  and  low  levels  of  gain. 

For  later  reference  the  approximations  to  a[T]  and  a[S]  given  by 

(3.27)  and  (8.29)  will  now  be  stated  in  terms  of  the  parameterization  (8.31). 

From  (8.27)  it  follows  that 


/(sin<t>)^  +  o2 

I  JS' 

|cos<i  +  o2  eJ  1 


(3.42 


In  this  section  an  example  of  a  transfer  function  describing  a 
two-input  two-output  system  will  be  given.  The  immediate  purpose  of  this 
example  is  to  illustrate  the  approximations  developed  in  Sections  8.2  and  8.3. 
A  secondary  purpose  is  to  provide  motivation.  It  will  be  seen  that  this 
example  possesses  feedback  properties  having  no  analogue  in  scalar  systems. 
Developing  the  theoretical  framework  necessary  to  understand  this  behavior 
is  the  primary  goal  of  this  dissertation. 

Consider  the  open  loop  transfer  function 


1.5  x  1(T 
(s  +  10) 


(s  +  .01)' 


-7.071  x  10  Cs  +  .1) 
(s  +  10)3 


(s  +  .01)  J 


The  gain  in  the  second  loop  of  this  system,  given  by  - 

I 

is  large  until  oi  ■  .01  rad/sec,  and  then  decreases  with  a  two-pole 


(s  +  .01)' 


roll-off  ( 88  40  db/decade) .  The  gain  in  the  first  loop  is  large  until  a)  =  .01 
rad/sec,  and  then  decreases  at  a  rate  of  20  db/decade.  The  effect  on  the  open 
loop  singular  values  is  shown  in  Figure  8.1.  The  largest  singular  value 


corresponds  to  the  gain  in  the  first  loop  while  the  smallest  singular  value 


-7071  x  10°  x  (s  +  0.1)  0.64 


(s+10)3 


(s +0.01)2 


10 


i-2 


1  102 
Frequency  (rad /sec) 
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Figure  8.1.  Singular  values  and  magnitudes  of  eigenvalues 
open  loop  system. 


corresponds  to  the  gain  in  the  second  loop.  In  the  frequency  range  over  which 
>>  a ^  the  right  singular  subspaces  are  aligned  with  the  standard  basis 
directions  (Figure  8.2).  This  fact  greatly  facilitates  analysis  of  this 
example.  First  the  angle  $  ,  which  is  a  measure  of  the  interaction  between  the 
two  physical  loops  of  the  system,  may  be  identified  with  the  angle  <f>  between 
the  left  and  right  singular  subspaces  (Figures  8.3  and  8.4).  (Recall  <f>  can 
be  interpreted  as  a  measure  of  interaction  between  high  gain  and  low  gain 
subsystems.)  A  second  consequence  is  that  the  two  measures  of  phase  difference 
given  by  ^  (7.15)  and  0^  (7.25)  are  very  nearly  equal  in  this  example 
(Figures  8.5  and  8.6). 

The  singular  values  of  the  sensitivity  function  are  plotted  in 
Figure  8.7.  (The  approximation  (8.37)  is  indistinguishable  from 
a[S]  »  a^fS]  on  the  scale  of  this  plot.)  Note  the  two  peaks  of  88  8  db  each 
in  o[S].  From  Figure  8.4  it  may  be  seen  that  the  second  of  these  corresponds 
to  a  large  value  of  $,  as  expected  from  Corollary  8.6.  The  first  peak  is  in 
the  crossover  frequency  range  in  which  cos$;  the  value  of  this  peak  may 

be  approximated  from  Corollary  8.7. 

This  example  illustrates  some  interesting  phenomena  not  present  in 
i9o 

SISO  systems.  If  a ^  eJ  L  were  a  stable  minimum  phase  transfer  function  with 

a  two-pole  roll-off  in  the  vicinity  of  gain  crossover  (e.g.,  - - - ~)» then 

(s  +  .01) 

the  value  of  3^  would  be  58  -179°.  This  follows  from  the  Bode  gain-phase 
relation  (2.24).  From  the  approximation  (8.37)  it  would  follow  that 
j[S]  55  34  db.  Instead  a[S]  ^  8  db.  The  immediate  reason  for  this  may  be 
seen  from  Figure  8.5.  As  the  gain  in  the  second  loop  of  this  system  begins 
to  roll-off,  the  phase  difference  02  becomes  increasingly  negative.  This 
might  be  expected  based  on  intuition  from  SISO  systems.  Upon  reaching  55  -160°, 


Frequency  (rad/sec) 


Phase  differences  between  "ith"  components  of 
pair  of  singular  vectors  -  open  loop  system. 
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however,  02  actually  Increases  to  **  -140°  in  the  vicinity  of  crossover.  In 
this  respect  it  is  as  though  a  phase  lead  filter  were  present  in  the  second 
loop  of  the  system.  On  the  other  hand,  if  such  a  filter  were  present  in  an 
SISO  system  the  40  db/decade  decrease  in  gain  could  not  be  maintained. 

It  is  interesting  to  note  that  a  phase  lead  filter  is  present  in 
the  first  loop  of  the  system  and  that  the  leading  action  occurs  in  the 
frequency  range  over  which  the  unusual  increase  in  02  occurs.  It  is  difficult 
to  see  a  direct  relation  between  these  two  phenomena.  One  can  easily  estab¬ 
lish,  though,  that  the  increase  in  the  gain  associated  with  the  lead 

filter  is  responsible  for  the  increased  interaction  between  the  two  loops  of 
the  system  and  thus  for  the  increase  in  $.  It  is  as  though  the  phase  lead 
present  in  the  first  loop  of  the  system  were  being  manifest  in  the  second  loop 
via  some  sort  of  helpful  loop  interaction.  Additional  evidence  for  this  was 
provided  by  the  observation  of  a  tradeoff  between  the  magnitudes  of  the  two 
peaks  in  a  [S  ] .  Increasing  the  ratio  !L21^Lill  by  a  constant  caused  the 
angle  <p  to  become  larger,  the  second  peak  to  increase  in  magnitude,  and  the 
first  peak  to  decrease.  Decreasing  this  ratio  produced  the  opposite  effect. 

The  measure  of  interaction  <p  became  smaller,  the  phase  ©2  became  more  negative, 
and  the  second  peak  decreased  at  the  expense  of  an  increase  in  the  first  peak. 

This  example  illustrates  phenomena  such  as  helpful  loop  interactions 
which  have  no  analogue  in  SISO  systems.  Moreover,  these  interactions  appear 
to  yield  additional  tradeoffs  among  feedback  properties  in  different 
frequency  ranges.  An  extended  analysis  of  this  example  will  be  performed  in 
a  later  chapter,  after  the  development  of  MIMO  generalizations  of  the  Bode 


gain-phase  relations. 


8.5.  A  Practical  Example 

The  purpose  of  this  section  is  to  analyze  an  example  of  a  practical 
feedback  system  using  the  results  of  Sections  8.2  and  8.3.  One  reason  for 
doing  so  is  to  verify  that  the  approximations  of  these  sections  are  valid  for 
systems  of  physical  (as  opposed  to  academic)  interest.  Another  reason  is  to 
illustrate  how  the  approximations  might  be  used  in  design. 

The  example  considered  is  described  in  [46].  It  consists  of  a 
laser  beam  whose  direction  is  controlled  by  two  mirrors.  Since  the  transfer 
functions  given  in  [46]  did  not  yield  the  plots  contained  therein,  the 
transfer  functions  were  modified  so  that  plots  similar  to  the  ones  given  were 
obtained. 

The  open  loop  transfer  function  for  this  system  is 


L(s) 


o 

1 

0 

“g  o  • 

(2600)  (2tt  j 

a 

2 

s 

(-1.825s  +  19. 5w) 

2.52 

(s  +  300tt) 

0  G 

yj 

(8.44a) 


^  r» \  /S  +  1068^  ,s  +  188. 5v 

Gc  (172'8)  (s  +  7480. V  (  s  ' 


(8.44b) 


r  =  (\  78?’>  (s  ^-$6 . 2  .  /s  +  62 . 8 . 
Gy  C1'782)  (s  +  1163. 7}  (  s  ) 


(8.44c) 


-i 


Bode  plots  of  the  elements  of  L(s)  are  found  in  Figures  8.8-8.10. 
Note  the  differences  in  the  bandwidth  of  the  two  loops.  Gain  crossover  for 
L.^  is  2300  rad/sec,  gain  crossover  for  L^2  85  410  rad/sec,  and  gain 
crossover  for  L„,  is  *  4000  rad/sec.  These  plots  show  the  existence  of  a 
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Figure  8. 8. a.  Gain  of  L(l,l). 
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Figure  8. 9. a.  Gain  of  L(2,l). 
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Figure  8.9.b.  Phase  of  L(2,l) 


Figure  8. 10. a.  Gain  of  L(2,2). 
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frequency  range  over  which  the  gain  in  the  first  loop  of  the  system  is  large 


while  that  in  the  second  loop  has  been  rolled  off.  Moreover,  the  large 
element  implies  the  two  loops  are  rather  strongly  coupled  over  this  frequency 
range.  The  singular- values  of  L(s)  are  plotted  in  Figure  8.11.  Note  that  the 
difference  in  bandwidth  between  the  two  loops  of  the  system  corresponds  to  a 
difference  in  the  bandwidth  of  the  singular  values.  Thus  it  should  be  possible 
to  apply  the  approximations  of  Section  8.3  to  estimate  feedback  properties. 

First,  note  that  the  angle  0  between  singular  subspaces  is  large 
(Figure  8.12)  and  is  related  to  the  coupling  between  loops  introduced  by  the 
element.  To  see  this,  observe  that  the  right  singular  subspaces  are 
aligned  with  the  standard  basis  directions.  This  follows  from  the  small  value 
of  <tu  (from  the  parameterization  7.26)  in  Figure  8.13.  Since  inputs  to  the 
individual  loops  of  the  system  are  thus  aligned  closely  with  inputs  in  the 
right  singular  subspaces,  it  follows  that  the  left  singular  vectors  may  be 
approximated  from 


a1  vx  =  I,u1 
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(8.46) 


Since  cosb  =*  1  and  |l.« j << |L-. | ,  it  follows  that 
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Figure  8.12.  Angle  between  singular  subspaces 
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Thus 


(8.48) 


and 


♦v  -  4  Si  -  4  hi  •  «•«> 

This  may  be  verified  from  Figures  8.8,  8.9  and  8.14. 

Comparing  the  relative  magnitudes  of  and  L21  (Figures  8.8  and 

8.9)  it  can  be  seen  that  $v  is  at  least  45°.  (Note  that  a  similar  procedure 

cannot  easily  be  applied  to  estimate  cos$v  using  the  smaller  singular  value. 

This  is  because  the  approximation  analogous  to  (8.46)  =■*•  (8.47)  must  deal 

with  products  of  large  and  small  numbers.  Such  products  cannot  easily  be  esti 

H  H 

mated.)  Next,  since  cos<p  =  \u±  vj  *  |e1  v±\9  it  follows  that  Thus  the 

identification  of  the  right  singular  subspaces  with  the  physical  loops  of 

the  system  implies  that  the  coupling  between  these  loops  produces  coupling 

between  the  high  gain  and  low  gain  subsystems  defined  in  (8.1).  Since 

Corollary  (8.6)  shows  that  a[S]  =»  — it  follows  that  the  large  level  of 

cos4> 

coupling  between  loops  will  cause  poor  sensitivity  properties  in  the 
appropriate  frequency  range.  (This  approximation  may  be  verified  from 
Figures  8.12  and  8.15.)  In  fact, there  exists  a  tradeoff  between  coupling 
and  poor  sensitivity.  This  was  pointed  out  in  [46];  however,  the  analysis 
was  made  entirely  a  posteriori  by  direct  evaluation  of  the  singular  value 
decomposition  of  the  sensitivity  matrix.  Note  the  SVD  of  the  sensitivity 


matrix  is  available  from  (3.17): 


(8.50) 
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Since  Che  right  singular  subspaces  approximate  the  standard  basis  directions, 
it  follows  that  the  measures  of  phase  difference  6^  and  6^  (defined  in 
(7.15)  and  (7.25))  are  approximately  equal.  This  may  be  verified  by 
comparing  Figures  8.16  and  8.17.  Thus  (3.50)  is  in  turn  approximated  by 


S  *  S 


app 


[-sin$v  e 


4 'J'v 


,  e-j02 
cos$  ]  (— — — ) 
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(8.51) 


Thus,  in  the  frequency  range  where  a[S]  ,  it  follows  that  the  rank  one 

cost? 

destabilizing  perturbation  of  smallest  norm  [46],  is  given  by 


E  » 


-  (cos$  e^92) 


0  -sin*  e‘^ 
v 

_0  cos*v 


(8.52) 


The  above  discussion  shows  that  much  valuable  information  about 
feedback  properties  may  be  obtained  from  analysis  of  the  open  loop  transfer 
function  in  the  frequency  range  over  which  the  gain  in  the  first  loop  is 
large  and  that  in  the  second  has  been  rolled  off.  In  Section  8.3  approxima¬ 
tions  to  feedback  properties  were  presented  for  the  frequency  ranges  in  which 
the  two  open  loop  singular  values  were  being  rolled  off.  The  approximations 
to  o[S]  given  by  (8.37)  and  (8.43)  are  plotted  in  Figure  8.18.  (Actually  the 
inverses  of  these  are  plotted  to  be  consistent  with  [46].)  The  approximations 
are  valid  to  within  a  few  dB;  certainly  they  are  much  closer  than  those 
obtained  from  using  single-loop  analysis  (Figure  8.19).  Note  that  since  L 
is  triangular,  the  same  results  are  obtained  using  the  eigenvalues  of  L  to 
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Figure  8.18.  a[I  +  L]  and  two  approxinations 


estimate  feedback  properties.  The  gains  of  the  eigenvalues  are  plotted  in 
Figure  8.11  while  the  phases  are  indistinguishable  from  the  parameters  9^ 
plotted  in  Figure  8.16.  Incidentally,  this  correspondence,  together  with  the 
fact  that  the  open  loop  singular  values  roll  off  at  approximately  the 
same  rate  as  the  eigenvalues  (Figure  8. 11),  shows  that  the  Bode  gain  phase 

1  Q 

relations  are  approximately  satisfied  by  the  functions  eJ  *.  As 
demonstrated  by  the  example  in  Section  8.4,  there  is  no  a  priori  reason  why 
this  should  be  true,  especially  since  there  is  a  significant  amount  of 
coupling  present  in  the  system. 

The  preceding  discussion  has  shown  that  much  valuable  information 
about  feedback  properties  may  be  obtained  from  analysis  of  the  open  loop 
transfer  function.  This  fact  has  several  advantages.  First,  these  feedback 
properties  may  be  determined  a  priori  before  a  design  is  completed,  rather 
than  a  posteriori  as  in  [46].  This  could  reduce  the  number  of  iterations 
needed  to  achieve  a  satisfactory  design.  Second,  the  existence  of  a  tradeoff 
between  the  level  of  coupling  and  good  feedback  properties  can  be  seen  and 
quantified.  These  two  facts  together  serve  to  point  out  the  usefulness  of 
this  method  of  analysis,  since  the  tradeoff  between  desirable  system  properties 
may  be  made  in  part  at  an  early  stage  in  the  design  process.  Thus  the  results 
of  Sections  8.2  and  8.3  yield  rules  of  thumb  concerning  the  effect  of  coupling 
high  and  low  gain  subsystems .  These  rules  may  be  used  to  gain  a  feel  for  the 
set  of  possible  designs  which  is  more  efficient  than  resorting  to  trial  and 
error.  This  characteristic  is  very  much  in  the  spirit  of  classical  methods. 


V‘> 


Finally,  as  discussed  in  [46],  the  reason  that  coupling  was  introduced  into 
this  design  was  to  improve  some  open  loop  system  properties  (reducing  torque 
requirements).  The  results  of  this  chapter  show  how  such  open  loop  phenomena 
may  be  related  to  feedback  properties. 


8.6.  Systems  with  Three  Inputs  and  Three  Outputs 

The  purpose  of  this  section  is  to  present  some  results  for  relating 
open  to  closed  loop  properties  of  systems  with  three  inputs  and  three  outputs. 
It  will  be  seen  that  there  exist  both  similarities  to  and  differences  from  the 
two  input  two  output  cases. 

Consider  a  system  with  three  inputs  and  three  outputs  and  suppose 
the  open  loop  transfer  function  has  much  larger  gain  in  one  direction  than  in 
others.  Then,  as  in  (8.1),  L(s)  may  be  written 

L  -  V]0!*1  +  V2Z2U2H  *  (8.53) 

Since  the  subspaces  11^  and  9^  are  one-dimensional,  the  matrices  in  (8.53)  may 
be  written  =»  u^  Vl  -  Vj.,  -  c^,  U2  -  [u2  j  u.j] ,  V2  =  [v2  j  v3]  and 
Z2  »  diag[o2>CT3].  Suppose  that  the  singular  values  are  numbered  so  that 
°1  >:>  °2  -  °3*  ^hen,  assuming  the  conditions  in  Theorem  8.3  are  satisfied, 
it  follows  that  the  sensitivity  function  may  be  approximated  by 

s  21  S„„„  4  +  t,)'V  .  (8.54) 


In  general  it  may  be  difficult  to  obtain  simpler  expressions  for 


a[S  ].  Note,  however,  that  lower  bounds  on  a[S  ]  may  be  obtained  from 
app  app 

upper  bounds  on  ct[S  Vl.  If  any  such  lower  bound  is  large,  then  the  system 
-  app 

has  poor  feedback  properties.  Thus,  write 
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From  (8.62)  it  follows  that  an  upper  bound  on  a [S  ^  ]  is  given  by  the  norm 

-  app 

of  each  row  and  of  each  column  of  S  ^  .  Define 

app 
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Then  (8.61)  and  (8.62)  imply 


a[S  ]  >  max  / 
app  -  ' 


l/t/jcos <t>2  +  a 2  e-’02|2  +  |v2Hu3|2 


1/>/|cos4>2  +  e^02|2  +  I  v3Hu2  I  2 

l/t^cos^-  +  c r-  e^02|2  +  jv  Hu_. 


3  ■  3  I  2  3 

1/J  cos$3  +  6 3  j  2  +  |v^Hu?|2 


3  2 1 


(8.) 


It  is  also  possible  to  obtain  upper  and  lower  bounds  on  a[Sapp] 


using  the  relation 


-IS  II  <  o[S  ]  <  ||S  I 
«  app il  p  -  app  -  H  app  II 


(8.65) 


The  bounds  (8.64)-(8.67)  may  be  used  to  relate  feedback  properties 

to  those  of  the  open  loop  system,  although  the  results  are  not  as  conclusive 

as  those  for  two-input  two-output  systems  in  Section  8.3.  Nonetheless,  one 

can  point  out  some  differences  between  systems  with  two  inputs  and  outputs 

and  those  with  three  inputs  and  outputs.  For  example,  the  quantities 
i  6 

!  cosd)i  +  on  eJ"i]  can  be  small  without  necessarily  causing  poor  feedback 

1  0 

properties  to  appear.  Suppose  |cos$3  +  eJ  3|  ^  o.  From  (8.64)  and 

i  -  H  i 

(8.67)  it  follows  that  o[S  ]  will  be  large  unless  both  |v«  u- |  and 

&PP  J  4 

?  ^  H 

^  u^l  are  bounded  sufficiently  away  from  zero.  Note  this  implies  two 
conditions.  First,  in  order  that  |v2  u^ |  and  |v3  u2  |  be  as  large  as  possible, 
outputs  of  L(s)  due  to  inputs  in  the  subspace  U2  =*  {column  space  of  U2  * 

[u2  |  u3]}  should  lie  primarily  in  U9  as  well.  Second,  there  must  exist 
coupling  within  the  low  gain  subsystem;  i.e.,  part  of  the  output  of  L(s)  due 
to  an  input  in  the  direction  spanned  by  u2  (respectively,  u^)  must  be  fed 
back  into  the  direction  spanned  by  (respectively,  u2). 
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Note  also  that  if  | cos^^  +  ffj  eJ  3j  0  and  if  a 2  >>  1  then  it 
follows  from  (8.65)  and  (8.67)  that  °tSapp]  >>  1  independently  of  the 
magnitudes  of  jv^^l  a°d  This  makes  sense  because  in  this  case 

outputs  from  the  low  gain  subsystem  represented  by  a^v^u^  can  only  be  fed 

jj 

into  a  subsystem  c2v2u2  wit^  much  higher  levels  of  gain. 

Suppose  that  there  exists  a  range  of  frequencies  over  which 
0^>>a2>>02*  The  MIMO  generalizations  of  crossover  frequency  discussed  in 
Section  8.2  may  be  applied  to  and  Does  there  exist  an  analogue  of 

crossover  frequency  for  02?  To  study  this,  note  that  the  conditions  a]_>>a2>>a3 
and  a^cclv^u^l  imply 
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If  a.  >>  |v0  u- I ,  then  a[S  ]  ** - — -  as  shown  in  Theorem  8.1.  To  obtain 

2  '2  2'  app  cos<f>3 

a  generalization  of  MIMO  crossover  for  o note  that  the  denominator  in 
(8.68)  must  be  bounded  away  from  zero.  This  motivates  definition  of  cross¬ 
over  for  (?2  as  being  that  frequency  for  which 
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Note  that  the  numerator  in  (8.69)  is  equal  to  |det[V  ,^U0]|.  Recall  the 
singular  values  of  V2HU2  are  e9ual  to  the  cosines  of  the  canonical  angles 
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between  the  subspaces  and  Denote  these  angles  by  a  and  a.  Then 


(8.69)  becomes 


cosa  •  cosa 


(8.70) 


Note  that  cosa  <  cos^  f  cosa.  Thus  (8.70)  gives  a  more  accurate  indication 

of  crossover  for  a ^  than  the  conditions  cosa  <  a2  -  cos2  imposed  in  Section 

8.2.  Also  note  that,  although  the  value  of  02  is  important  when  (8.70)  holds, 

©2  should  not  necessarily  be  bounded  away  from  ±ir.  This  follows  since  the 
*  H  *  h 

quantity  [-(v^  ^)(v2  u^)]  can  achieve  complex  values. 

The  results  of  this  section  have  shown  that  it  is  possible  to 
obtain  approximations  to  feedback  properties  for  higher  dimensional  systems. 
The  results  are  not  as  conclusive,  however,  and  this  area  definitely  needs 
further  research. 


8.7.  Summary 


The  purpose  of  this  chapter  has  been  to  generalize  one  of  the 
important  aspects  of  classical  feedback  theory  to  MIMO  systems.  This  aspect 
was  the  ability  to  approximate  feedback  properties  in  terms  of  the  open 
loop  transfer  function.  As  described  in  Section  8.5,  the  results  of  this 
chapter  may  be  used  to  obtain  rules  of  thumb  useful  in  design. 

The  results  may  also  be  used  to  gain  information  about  various 
design  procedures.  For  example,  linear  quadratic-state  feedback  designs 
are  guaranteed,  under  certain  conditions  (e.g.,  [4,  p.  13]),  to  possess 


the  property  that  o[S(jto)]  <  1,  V<u.  Some  implications  of  this  fact  are 
pointed  out  in  Section  7.3.  The  results  of  Section  3.2  show  that  it  is 
not  possible  to  use  this  design  procedure  to  obtain  a  design  with  strongly 
coupled  high  and  low  gain  subsystems. 

Finally,  note  that  the  results  of  this  chapter  are  based  upon 
algebraic  system  properties;  i.e.,  properties  which  are  related  at  a  given 
frequency  by  an  algebraic  equation.  Based  upon  these  results,  it  might  be 
conjectured  that  a  rule  of  thumb  for  good  design  is  that  coupling  between 
subsystems  with  different  levels  of  gain  is  always  to  be  avoided.  Certainly 
this  would  be  true  if  system  properties  at  different  frequencies  could  be 
manipulated  independently. 

The  various  Bode  integral  and  gain-phase  relations  show  that  system 
properties  at  different  frequencies  are  not  independent  even  for  SISO  systems. 
Thus  the  existence  of  coupling  between  high  and  low  gain  subsystems  might  be 
used  to  affect  the  value  of  some  system  parameter  which  might  not  appear, 
a  priori,  to  depend  upon  coupling.  This  is  best  demonstrated  by  the  example 
in  Section  8.4.  This  example  exhibited  feedback  properties  which  appeared  to 
violate  intuition  based  upon  SISO  Bode  gain-phase  relations.  In  order  to 
achieve  this,  however,  it  was  necessary  to  introduce  coupling  into  the 
system.  Thus  the  rule  of  thumb  conjectured  above  fails  when  feedback  prop¬ 
erties  constrained  by  the  analyticity  of  transfer  functions  are  considered. 
Describing  such  constraints  is  the  subject  of  the  remainder  of  this  thesis. 
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CHAPTER  9 

DIFFERENTIAL  EQUATIONS  FOR  SINGULAR  VALUES  AND  VECTORS 


9.1.  Introduction  and  Motivation 

In  Chapter  2,  several  Bode  Integral  relations  were  introduced  and 
their  implications  for  design  were  discussed.  In  Chapter  3,  one  Bode 
relation  was  generalized  and  new  integral  relations  were  presented.  Recall 
that  these  relations  show  not  all  design  specifications  are  achievable  and, 
therefore,  tradeoffs  among  desirable  system  properties  in  different  frequency 
ranges  must  be  performed. 

Although  the  Bode  integrals  are  precise  quantitative  statements, 
their  greatest  value  may  lie  in  the  qualitative  insight  they  afford  into 
system  properties.  This  insight  allowed  the  development  of  various  rules 
of  thumb  useful  In  feedback  design.  As  an  example,  consider  the  Bode  gain- 
phase  relation  (Theorem  2.2).  This  relation  shows  that  the  open  loop  gain 
cannot  be  rolled  off  much  faster  than  20  db/decade  near  crossover  frequency. 
The  weighting  function  (2.25),  however,  shows  that  the  value  of  phase  near 
crossover  is  relatively  independent  of  the  gain  behavior  at  frequencies  much 
more  than  a  decade  or  so  away.  (However,  see  [47]  for  discussion  of  a  case 
where  this  rule  of  thumb  fails  to  hold.)  In  any  event,  the  integral 
relations  and  interpretations  developed  by  Bode  provide  the  theoretical 
foundation  upon  which  the  practice  of  feedback  design  is  based. 

The  problem  of  deciding  whether  a  given  design  specification  is 
achievable  can  be  restated  as:  Under  what  conditions  is  a  given  function 
of  the  complex  frequency  variable  realizable  as  the  transfer  function  of  a 
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linear  time  invariant  system?  From  well-known  properties  of  the  Laplace 
transform,  it  follows  that  one  such  condition  is  that  the  function  must  be 
locally  analytic  in  the  frequency  variable.  This  fact  is  significant  in 
that  the  mathematical  cools  used  to  derive  the  Bode  integral  relations  are 
found  in  analytic  function  theory.  It  is  apparent  that  the  same  tools  must 
play  a  role  in  any  attempt  to  extend  these  integral  relations  to  MIMO 
systems . 

First,  note  one  way  of  viewing  the  Bode  gain-phase  relations 
(Theorem  2.2)  is  that  they  show  how  partial  knowledge  or  partial  specification 
of  a  transfer  function  can  suffice  to  completely  determine  the  function. 

Thus,  although  two  real  parameters  or  coordinates  are  necessary  to  describe 
a  transfer  function  (e.g.,  gain  and  phase),  there  is  in  essence  only  one 
degree  of  freedom  available  for  design. 

Arguments  based  upon  degrees  of  freedom  can  be  made  for  MIMO 

systems  as  well.  Consider  a  matrix  transfer  function  written  in  standard 

coordinates;  each  element  of  this  matrix  is  a  scalar  transfer  function. 

Hence,  the  Bode  gain-phase  relations  can  be  applied  to  each  of  these  scalar 

2 

functions  to  show  that,  of  the  2n  real  coordinates  needed  to  describe  a 
transfer  function  matrix  in  C  ,  there  exist  only  n  degrees  of  freedom. 

The  difficulty  with  this  observation  is  that  the  gain  and  phase  of  these 
elements  do  not  generally  yield  useful  information  about  feedback  properties. 

It  was  argued  in  Chapters  6-8  of  this  thesis  that  certain 
parameterizations  of  the  singular  value  decomposition  of  the  open  loop 


transfer  function  matrix  are  useful  in  assessing  feedback  properties. 


These  parameters  include  the  singular  values,  parameters  describing  the 

directions  of  system  inputs  and  outputs,  and  various  measures  of  phase 

difference.  In  Section  7.4,  two  parameterizations  of  transfer 
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function  matrices  in  C  were  presented.  Each  of  these  parameterizations 
involved  eight  real  coordinates;  the  above  argument  using  standard 
coordinates  and  scalar  gain-phase  relations  shows  that  these  eight  new 
coordinates  must  also  only  represent  four  degrees  of  freedom.  An  example 
giving  some  indication  of  the  relation  among  these  parameters  was  presented 
in  Chapter  8,  Section  8.4.  The  fact  that  these  parameters  do  appear  to 
be  related  to  feedback  properties  motivates  study  of  how  the  property  of 
local  analyticity  is  preserved  under  coordinate  transformations. 

The  most  useful  statement  of  the  dependence  among  system  parameters 
is  given  by  the  gain-phase  relations.  There  are  other  equations  which  must 
be  satisfied  by  analytic  functions  which  are  more  simple  to  treat  mathemat¬ 
ically  than  are  the  gain-phase  relations .  These  include  the  Cauchy-Riemann 
equations,  Laplace's  equation,  and  Cauchy's  integral  formula.  The  manner  in 
which  these  equations  are  preserved  after  coordinate  transformations  will 
thus  be  investigated  first. 

The  remainder  of  this  chapter  is  organized  as  follows.  In  Section 
9.2  some  elementary  properties  of  analytic  functions  are  reviewed  and  the 
question  of  whether  singular  values  possess  any  of  these  properties  is 
addressed.  In  Section  9.3,  formulas  for  the  derivatives  of  singular  vectors 
are  presented.  It  is  shown  that  singular  values  and  singular  vectors  are 
not  independent  quantities.  Knowledge  of  singular  values  indeed  determines 
some  properties  of  the  singular  vectors.  This  relationship  will  be  inves¬ 
tigated  in  detail  in  later  chapters  of  this  thesis. 


The  mathematical  tool  used  by  Bode  in  deriving  the  integral 


relations  discussed  in  the  previous  section  was  complex  variable  theory. 

This  theory  was  applicable  since  transfer  functions  are  locally  analytic 
in  the  complex  frequency  variable.  It  is  not  apparent,  however,  that  the 
same  tools  may  be  applied  to  study  singular  values  of  matrix  transfer 
functions.  Since  singular  values  are  real  they  cannot,  by  definition,  be  even 
locally  analytic  functions  of  s. 

Recall  that  one  interpretation  of  the  scalar  gain-phase  relations 
emphasized  that,  along  the  jm-axis,  knowledge  of  the  gain  of  a  transfer 
function  suffices  to  determine  the  phase  as  well.  This  follows  since  gain 
and  phase  are  not  independent  quantities  under  the  conditions  of  Theorem  2.2. 
The  last  observation  motivates  the  question:  Does  knowledge  of  a  singular 
value  of  a  transfer  function  matrix  suffice  to  determine  any  other  properties 
of  the  matrix? 

The  Bode  gain-phase  relations  express  the  interdependence  of 
scalar  gain  and  phase  in  the  form  which  is  most  useful  in  applications  to 
feedback  theory.  It  is  mathematically  more  tractable,  however,  first  to 
study  this  interdependence  as  expressed  by  the  Cauchy- Riemann  equations  and 
properties  of  harmonic  functions .  To  do  this  requires  some  results  from 
complex  variable  theory.  Background  information  is  found  in  [26]  and  [48]. 
Definition  9.1.  [48,  p.  34]:  Let  D  be  an  open  subset  of  C,  the  complex 
plane.  Then,  a  function  f  :  D  +  C  is  analytic  if  f  is  continuously 
differentiable  in  D;  i.e.,  if  the  derivative  of  f  with  respect  to  s 
exists  and  is  continuous  at  all  points  of  D.  ■ 


Definition  9.2.  [48.  p.  40]:  A  region  is  an  open  connected  subset  of  the 

complex  plane.  ■ 

Theorem  9.3.  [48,  p.  42]:  Let  g  and  h  be  real-valued  functions  defined  on 

a  region  D  of  the  complex  s-plane,  s*x+jy.  Suppose  that  g  and  h  have 
continuous  first  partial  derivatives.  Then,  the  function  f  :  D  -+■  C,  defined 
by  f(s)  ■  g(s)  +  jhCs),  is  analytic  if  and  only  if  g  and  h  satisfy  the 
Cauchy-Riemann  equations: 

la  „  3h 

3x  "  3y 

li  .  -  U» 

3y  3x 


Equivalently , 


3f  =  1  3f 
3x  =  j  3y 


(9. 


If  f(s)  is  analytic  and  nonzero  on  some  region  D  then, 

log  f  =  log | f |  +  j*  f  (9.: 


is  also  analytic  on  D.  Moreover,  the  log  magnitude  and  phase  of  f  satisfy 
the  Cauchy-Riemann  equations : 


Equations  (9.3)  can  be  used  to  show  that  knowledge  of  the  gain  of 


a  transfer  function  f(s)  in  a  simply  connected  region  D  where  f(s)  is  analytic 
and  nonzero  suffices  to  determine  the  phase  of  f(s)  up  to  an  arbitrary 
constant.  Given  the  value  of  4  f  at  some  point  sq€D,  the  value  at  any 
other  point  s€D  can  be  computed  by  a  line  integral.  Let  y:  [0,1]  D  be  a 
piecewise  smooth  path  with  y(0)  ■  sq  and  y(l)  ■  s.  Then,  4  f(s)  can  be  found 
by  integrating  the  differential  form 


d*  f 


3*  f 
3x 


dx  + 


3*  f 
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3 log | f | 
3x 
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(9.4) 


along  y .  Thus , 


*  f(s) 


4  f(s  )  +  /  d*  f 
o  4 
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4  f (so)  + 


1  p*4_f  3x 

J  L3x  3t 


34  f 
3y  3t_ 


dt 


(9.5) 


Using  the  Cauchy-Riemann  equations  shows  that  the  integral  in  (9.5)  is 
determined  by  | f | : 

4£(S)  -  «(s0)  +  Jl  [- If  +  |i]  dt  .  (9.6) 

The  fact  that  4  f(s)  is  well-defined  in  a  simply  connected  region 
follows  from  the  fact  that  the  line  integral  in  (9.5)  is  independent  of  the 
path  y.  Path  independence  in  a  simply  connected  region  is  guaranteed,  provided 
that  the  differential  form  (9.4)  satisfies*  the  condition 


The  elementary  properties  of  differential  forms  used  in  this  chapter  can 
be  found  in  [49]. 
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d^f  «  0, 


d2*  f 


/32*  f 
l  3y3x 


32*  f 


dxAdy 


(9.7) 


Note  that  (9.7)  is  equivalent  to  the  familiar  condition  that  mixed  second 
partial  derivatives  are  equal. 

From  the  Cauchy-Riemann  equations  it  follows  that  (9.7)  is  also 
equivalent  to  the  requirement  that  log | f |  satisfy  Laplace's  equation; 


(9.8) 


Definition  9. A.  f48 .  P.  411:  A  function  with  continuous  second  partial 
derivatives  satisfying  Laplace's  equation  is  said  to  be  harmonic.  ■ 

If  D  is  a  simply  connected  region  and  g  :  D-*H  is  harmonic  on  D,  then  it 
can  be  shown  [48,  p.  43]  that  there  exists  another  harmonic  function 
h  :  D  -*■  TR  such  that  the  functio..  f  :  D-*-<C  defined  by  f  *  g  +  jh  is  analytic 
on  D.  Such  a  function  is  termed  a  harmonic  conjugate  of  g.  It  is  a  fact 
that  the  real  part  of  any  function  analytic  on  some  region  D  is  harmonic  on 
D  and  the  imaginary  part  of  the  analytic  function  is  a  harmonic  conjugate. 

In  particular,  on  simply  connected  regions  over  which  the  logarithm  of  a 
transfer  function,  log  f(s),  is  analytic,  it  follows  that  the  phase,  *f(s), 
is  a  harmonic  conjugate  of  the  log  magnitude,  log j  f (s ) [ . 

Since  singular  values  are  a  generalization  of  gain  to  MIMO  systems, 
one  might  conjecture  that  each  singular  value  is  the  magnitude  of  some 
locally  analytic  function  (as  is  the  gain  of  a  scalar  transfer  function) . 
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From  the  preceding  discussion  this  conjecture  would  be  true,  provided  that 
log  is  locally  a  harmonic  function.  The  following  example  shows  the 
conjecture  is  false,  however. 

Example  9.5:  (Due  to  Doug  Looze.) 

Let 


M(s) 


s  0 

2  s 


The  singular  values  of  M(s)  are  given  by 

-  1  +  /\s\2  +  1 

Oj  ■  -1  +  /is|2  +  1 


from  which  it  may  be  verified  that 


2  2 
3  logo  3  logo - 


3 


2  2 
3  logo-  3  logo - 


Proof:  See  Appendix  F.  ■ 

The  fact  that  log  o ^  is  not,  in  general,'  harmonic  shows  that 
o^  cannot,  in  general,  be  the  magnitude  of  some  analytic  function.  In 
particular,  knowledge  of  o^  in  a  simply  connected  region  does  not  suffice 


to  determine  the  value  of  a  harmonic  conjugate  in  the  same  way  that  the  log 
magnitude  of  a  scalar  transfer  function  determined  the  phase  of  the  function. 
Nonetheless,  it  can  be  shown  that  knowledge  of  a  singular  value  of  a  matrix 
transfer  function  does  determine  other  properties  of  the  matrix.  This  will 
be  seen  from  expressions  for  the  derivative  of  a  singular  value. 

Since  singular  values  are  not  analytic  in  the  complex  frequency 
variable  s  =  x  +  jy,  it  follows  by  definition  that  the  derivative  of  a 
singular  value  with  respect  to  s  does  not  exist.  Partial  derivatives  with 
respect  to  x  and  y  do  exist,  however,  although  some  care  must  be  taken  for 
the  case  of  multiple  singular  values.  A  thorough  discussion  of  the  existence 
of  partial  derivatives  of  singular  values  and  vectors  is  contained  in 
Appendix  G.  To  illustrate  the  discussion  of  this  section  and  Section  9.3, 
it  is  sufficient  to  consider  only  distinct  singular  values.  Proof  of  the 
existence  of  derivatives  is  deferred  to  Appendix  G.  The  following  formula 
for  the  derivative  of  a  singular  vector  was  originally  derived  in  [10]  but 
is  given  here  for  the  sake  of  completeness. 

Recall  (Appendix  B)  that  the  square  of  each  singular  value  is  an 
eigenvalue  of  the  matrix  [M(s)]^M(s)  with  eigenspace  given  by  the  right 
singular  subspace: 

[M(s) ]HM(s)ui(s)  =  a^ui(s)  .  (9.9) 

From  (9.9)  it  follows  that  (suppressing  dependence  on  s)  : 


Since  ui  is,  by  definition,  a  unit  vector: 
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u  u  =  1 
i  i 


a  H 
3ui  ui 


3x 
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■  V 
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r  _  H  „ 

3ui  A  H  3ui 

Ui  3x 
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3x  ui  ui  3x 
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Substituting  (9.11)  into  (9.10)  yields 
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i  3x 
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*4-  K  +  MH 
3x  3x 


3ai  T  H  3M 

~1Z  =  Re  Lvi  ui_ 


Similarly, 
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Re 


H  3M  I 

'i  3y  UiJ 


(9 


(9 


(9 


The  fact  that  M(s)  is  locally  analytic  implies  that  each  element 
of  M(s)  must  satisfy  the  Cauchy-Riemann  equations.  This  in  turn  yields: 
Lemma  9.6:  Let  u  and  v  be  vectors  in  (jn.  Then  if  each  element  of 


M(s)  is  analytic  at  s  =  x  +  jyej,it  follows  that 


f  H  3M  |  _  r  H  3M  ”1 

Lv  s^uJ  *  ImLv  S7UJ 

r  H  3M  1  T  r  H  3M  1 
v  —  u  ■  -Im  v  —  u 

L  3y  J  L  J 


Proof :  The  Cauchy-Riemann  Equations  (9.1)  yield: 


3M  _  1  3M 
9x  j  3y 


H  3M  1  H  3M 
v  3x  U  “  j  V  3y  U 


Taking  real  and  imaginary  parts  of  (9.15)  yields  the  result.  ■ 

Note  the  similarity  of  Equations  (9.14)  to  the  Cauchy-Riemann 
Equations  (9.1).  Together  with  Equations  (9.12)-(9.13) ,  Lemma  9.6  shows 
that  each  singular  value  of  a  matrix  locally  analytic  in  s  determines 
a  differential  form  defined  by  (dropping  the  subscript  "i"  for  convenienc 


.  A  1  T  P  H  3M  .1  _  f  H  3M  1. 

A  =  o  Im  LV  3^  UJ  dX  a  lm  LV  97  Uj  dy 

-1  „  I-  H  3M  1.  .1  „  r  H  3M  1. 

■  7  Re  |_V  37  “J  ^  +  7  Re  1_V  3^  “J  iy 
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Recall  that  a  differential  form  (9.4),  similar  to  (9.16),  was 
found  to  relate  the  gain  and  phase  of  a  scalar  transfer  function.  Again, 
one  might  conjecture  there  exists  some  function  0  such  that  AEd0.  By  th 
Poincard  Lemma  [49  ,  p.  94],  such  a  function  0  exists  if  and  only  if 
dA*0.  But,  as  Example  9.5  has  shown,  this  is  generally  impossible;  the 


and  this  is  not  generally  true. 


The  preceding  discussion  shows  that  the  differential  form  (9.16)  does 
not  arise  from  the  differential  of  any  function  (in  geometric  language,  [49], 
the  form  is  not  exact).  In  particular,  there  exists  no  function  6  for  which 
logo+  j 9  is  locally  analytic.  Hence  o  cannot,  in  general,  be  the  magnitude 
of  a  locally  analytic  function.  Nonetheless,  knowledge  of  a  singular  value 
does  determine  other  properties  of  the  matrix  M.  Thus  one  may  now  pose 
the  question:  What  is  the  significance  of  the  differential  form  A  in  (9.16) 
which  is  determined  by  a(x,y)?  To  answer  this  question  requires  a  discussion 
of  the  derivatives  of  the  singular  vectors. 


9.3.  Derivatives  of  Singular  Vectors 

In  this  section,  formulas  for  the  partial  derivatives  of  singular 
vectors  are  presented.  As  in  Section  9.2,  questions  of  existence  are 
relegated  to  Appendix  G;  similarly,  the  discussion  in  this  section  is 
restricted  to  singular  values  of  multiplicity  one. 

First,  recall  the  one  degree  of  freedom  which  exists  in  choosing 
each  pair  of  left  and  right  singular  vectors.  This  degree  of  freedom  is 
discussed  in  detail  in  Appendix  B  but  is  reviewed  here  for  convenience. 

Corresponding  to  each  singular  value  a ^  of  multiplicity  one 
there  exist  uniquely  defined  one-dimensional  right  and  left  singular 
subspaces  denoted  11.  and  0.,  respectively.  These  subspaces  are  determined 
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by  a  min-max  principle  inherited  from  properties  of  Hermitian  matrices. 

One  can  choose  the  right  singular  vector  as  an  arbitrary  unit  vector 
u^€  U^.  if  j  0,  then  the  left  singular  vector  is  uniquely  determined 

by  the  conditions  that  v^€l?^  and  that  o^  =  v^^Mu^  is  a  positive  real 
number.  To  illustrate  the  degree  of  freedom  available,  let  u^  be  one  choice 
of  right  singular  vector  and  let  v^  be  the  corresponding  left  singular 
vector.  Then,  choosing  a  different  right  singular  vector  u^  =  eJ  u^  yields 
a  different  left  singular  vector  v^  ■  e^°v^. 

Now,  consider  the  singular  vectors  along  a  curve  in  the  complex 
plane:  y(t)  *  x(t)  +  jy(t) .  Since  it  was  assumed  that  the  singular  values  are 
distinct  along  y(t),  it  follows  by  continuity  that  this  property  holds  in  a 

neighborhood  of  each  point  in  the  image  of  y.  The  results  of  Appendix  G  can 

then  be  applied  to  show  that  the  singular  vectors  may  be  chosen  to  be  differen¬ 
tiable  in  (x,y)  in  some  neighborhood  of  etch  point  in  the  image  of  y.  One  can 
think  of  the  motion  of  a  singular  vector  along  y(t)  as  being  decomposed  into  two 
components.  One  component  measures  how  the  appropriate  singular  subspace  is 
changing  along  y  and  is  well  determined  by  the  min-max  characterization  of 
singular  values  and  vectors.  The  other  component  is  a  function  of  which  unit 
vector  in  the  subspace  is  being  "chosen"  as  the  singular  vector.  Note  that 
this  component  would  be  arbitrary  (subject  to  smoothness)  if  the  convention 
that  ■  v^^Mu^  be  positive  real  were  not  imposed;  obviously,  *  |v  j 
for  any  unit  vectors  in  the  appropriate  subspaces.  A  helpful  picture  to  keep 

in  mind  is  that  of  a  vector  which  moves  along  a  path  in  space  (motion  of  the 

singular  subspace)  as  well  as  "spins"  as  it  moves  (motion  within  the  singular 
subspace).  This  analogy  will  be  made  precise  in  Chapter  10.  First,  formulas 
for  partial  derivatives  of  singular  vectors  will  be  discussed. 
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Let  and  be  right  and  left  singular  vectors  corresponding 
to  a  singular  value  a ^  of  multiplicity  one.  Write  the  derivatives  of 
u^  and  v^  with  respect  to  x  as  (a  similar  formula  holds  for  the  derivatives 


with  respect  to  y) 
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Using  various  properties  of  the  singular  values  and  vectors  yields,  for 
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Similarly, 
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From  (9.11)  it  follows  that 


and 

*•  [viH  ]  * 0 

f  H  3uil  r  H  3vi“ 

It  remains  to  calculate  Im  — —  and  Im  v^  — —  . 


(9.21)  V. 


From  the  identity 


Note  that  the  right  hand  side  of  (9.22)  shows  that  the  left  hand  side  is 
independent  of  the  choice  of  singular  vectors. 

The  degree  of  freedom  available  in  the  choice  of  singular  vectors 
manifests  itself  in  the  differential  equations  governing  their  behavior.  To 
see  this,  consider  a  path  y  -  x  +  jy  :  [0,l]-*-C.  In  Appendix  G  it  is  shown, 
using  results  of  Kato  [32],  that  there  exists  a  unit  vector  u^(t)  defined 
along  y(t)  with  the  properties  that  u^t)  is  analytic  in  t  and  that  ui(t)€ 
IL(t)  ,  the  ith  right  singular  subspace.  Thus  u^(t)  is  one  possible  choice 
of  the  ith  right  singular  vector.  Let  8(t)  be  an  arbitrary  real  valued 
function  analytic  in  t.  It  follows  that  another  choice  of  ith  right  singular 
vector,  also  analytic  in  t,  is  given  by 

ut(t)  -  ejB(t)  ut(t) 
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A  jj  a  II 

Since  and  are  unit  vectors,  u^  -j£-  and  u^  must  be  purely  imaginary 


Hence  the  lack  of  uniqueness  in  choosing  the  right  singular  vector  indeed 

manifests  itself  in  the  differential  equation  governing  the  vector.  The 

right  singular  vector  may  be  chosen  so  that  the  component  of  its  derivative 
H  ^ui 

given  by  u^  is  any  arbitrary  function  of  t,  subject  only  to  differen¬ 

tiability  requirements. 

It  remains  to  verify  that  this  Is  indeed  the  only  degree  of  freedom 


available  in  choosing  the  differential  equations  governing  the  singular  vector 


du 

H  i 

pair.  To  see  this,  first  note  that  once  the  component  u^^  is  specified 

H  dvi 

it  follows  that  v^  — is  completely  determined  from  the  equation 
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dui  dvi 

Second,  note  that  -r—  and  -rr-  are  not  functions  of  the  choices  of 

at  dt  - 


right  and  left  singular  vectors  u^  and  v^,  k#i.  This  follows  since  the 


H  H 

projection  operators  u^u^  a°d  vkvk  *n  (9.17)  and  (9.18)  are  functions  only 


of  the  singular  subspaces  and 


To  summarize,  one  can  construct  a  pair  of  left  and  right  singular 
vectors  along  any  path  y(t)  (over  which  the  corresponding  singular  value  is 
distinct)  by  integrating  the  differential  equations 

dt  \<U1C  dt  > 

dvi  ;  ,  Hdvi. 

it  ■  J,  ir) 

k“l 

H  dui  H  dvi 

In  these  equations  the  terms  11^(11^  and  vfc(v  ~^)  for  k^i  are  completely 

determined  from  (9.19)  and  (9.20)  and  similar  equations  in  y.  The  terms 
H  d^i  H  dvi 

u^(ui  -j^-)  and  vi^vi  ”dt  ^  Possess  one  degree  of  freedom.  Once  this  degree 

&  a  '  H  dui  H  dVi 

of  freedom  is  utilized  in  choosing  the  component  u,  — —  or  v^  — —  (or  some 

i  at  1  at 

combination  thereof),  the  remaining  component  is  completely  determined  from 
(9.22)  and  a  similar  equation  in  y. 

ft 

Given  an  initial  condition  u . (0) ,  let  u, (t)  denote  the  right  sin- 

H  d"i 

gular  vector  obtained  by  setting  u^  =  0  and  integrating  the  resulting 

differential  equation  along  y(t).  Note  that,  of  all  possible  choices  of  u  (t) 

*  lduil 

along  y,  the  choice  u^(t)  yields  the  minimum  possible  magnitude  of  |  — ^  | ^ .  This 

du* 

.  H 

follows  since  u*  — e  0  by  construction  and  the  other  components  can  change 
only  by  a  multiple  of  e^13^  for  some  real  valued  6(t).  Thus,  u*(t)  changes 
along  y  only  enough  to  remain  in  the  correct  singular  subspace  and  no  more. 

With  this  choice  of  right  singular  vector  the  left  singular  vector 
is  completely  determined;  from  (9.22) 


H  dVi  1 
vi 


f  HdM 
Vi  dt  Ui 


(9.23) 


The  magnitude  of  —  Im[v^  —  u^j  indicates  how  much  the  left  singular  vector 


chosen  as  described  deviates  from  the  "minimum  energy"  left  singular  vector 


H  dvi 

obtained  by  setting  v.  —7—  -  0. 

1  at 


Of  all  possible  choices  of  singular  vectors,  the  choice  such  that 
dvi  H  dui 

~ ^  ■  0  or  u^  — ■  0  yields  the  minimum  possible  value  of 
qi  subject  to  (9.22).  This  fact  should  have  significance  since 


either  v 
du 


1  T  r  H  dM  . 
-Im[v±  jj-uJ 


J-  r„r„  «  >!  „  ,*  x  1  ,  ,  >  »  ,dy 
^Infvi  S  “i’dF  +  37  Intvi  -57uJdt 
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i  dx 
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3y  dt 


i  Qi. 

3x  dt 


(9.24 


A  A 
ildt; 


where  the  last  equality  follows  by  evaluating  the  differential  form  (9.16) 
on  the  tangent  vector  to  the  curve  y. 

Thus  the  quantity  ~  Imtv^  ~  u^]  ,  which  measures  the  deviation  of 
the  singular  vector  pair  from  the  pair  which  moves  just  enough  to  remain  in 
the  proper  singular  subspaces,  is  determined  by  the  singular  value  ck. 

In  answer  to  the  question  posed  at  the  end  of  Section  9.2  it  can  be 
said  that  the  singular  value  determines  some  property  of  the  singular 
vector  pair. 

Now,  recall  the  two  components  of  singular  vector  motion  discussed 

3 

above.  An  analogy  was  made  between  these  two  components  and  vectors  in  1R 

which  move  in  space  as  well  as  "spin"  as  they  move.  If  the  spin  component 

H  dui 

can  be  identified  with  u^  — - then  setting  this  component  equal  to  zero 
implies  the  vector  is  not  spinning  as  it  moves.  The  following  discussion 
will  hopefully  clarify  this . 


One  can  consider  spinning  as  a  generalization  of  the  following 


situation.  Suppose  that  along  y(t)  the  right  singular  subspace  is 

H  ^ui  1 9 ( t ) 

constant  (u.  *  0,  i?4  j).  Then,  u^t)  ■  eJ  '  ;  u^  where  u^  is  constant 

H  ^ui  d9 

and  u^  — ■  j  The  change  in  0  along  Y  can  be  thought  of  as  spinning 

or  twisting  of  the  singular  vector.  (Think  of  a  pencil  being  held  in  a 

fixed  direction,  but  being  twisted  about  its  lengthwise  axis.)  Now  it  is 

H  dui 

clear  that  for  the  case  of  a  constant  singular  subspace,  u.  — — “0~u,(t)  * 

-  1  at  l 

constant.  Now,  intuition  suggests  that  a  motion  analogous  to  spinning  takes 
place  even  if  the  singular  subspace  is  not  constant.  In  particular. 


H  dui 


H  dUi 


u^  — —  should  be  a  measure  of  the  spinning  and  u^  “  0  should  mean  that 

u^(t)  is  not  spinning.  Consider  a  thought  experiment:  Suppose  Y(t)  is  a 

closed  curve;  i.e.,  Y  '  (0,1]  -►  C,  y(0)  -  y(1).  If  the  singular  vector 

does  not  spin  then  will  u. (1)  »  u,  (0)  when  (9.17)  is  integrated  with 
H  dui 

u^  — 0?  Clearly,  this  will  be  the  case  if  the  singular  subspace  is 

constant;  thus,  if  u^l)  j4  u^(0)  it  must  be  due  to  the  motion  of  the 

singular  subspace  along  y.  These  questions  will  be  pursued  shortly. 

In  earlier  chapters  it  was  argued  that  certain  measures  of  phase 

difference  between  singular  vectors  have  interpretations  in  terms  of 

feedback  system  properties.  The  fact  that  each  singular  value,  via  (9.24), 

constrains  a  quantity  which  determines  the  difference  in  components 
H  dui  H  dvi 

u^  — and  v^  — —  suggests  that  this  measure  of  phase  difference  might  be 
related  to  these  components.  Thus  in  the  attempt  to  generalize  Bode 


gain-phase  relations  to  MIMO  systems  it  would  appear  that  understanding  the 

property  of  singular  vector  pairs  governed  by  —  Im[v^  ^  u^]  may  be  of 

i 

significance . 


9.4.  Summary 

The  purpose  of  the  preceding  discussion  has  been  to  motivate 
further  investigations  of  the  property  of  each  singular  vector  pair 
determined  by  the  singular  value.  To  summarize,  in  SISO  systems,  once 
gain  is  specified  then  phase  is  determined  and  this  has  important  design 
implications.  In  MIMO  systems  gain  and  gain  directions,  as  quantified  by 
singular  values  and  subspaces,  as  well  as  certain  measures  of  phase  difference 
between  singular  vectors  have  design  implications.  Now,  each  singular 
value  determines  some  property  of  the  associated  singular  vector  pair; 
intuitively  this  should  have  something  to  do  with  the  measure  of  phase 
difference  between  the  pair  of  singular  vectors.  In  order  to  pursue 
this  question,  it  is  convenient  to  assign  coordinates  to  the  singular 
vectors  and  to  use  these  coordinates  to  study  the  solutions  to  the 
differential  equations  governing  these  vectors.  This  is  pursued  in  the 
next  chapter. 

Finally,  a  formula  for  the  Laplacian  of  the  logarithm  of  a 
singular  value  will  be  presented.  For  simplicity  it  will  be  assumed  that 
the  singular  values  are  all  distinct. 


Theorem  9.7:  Assume  that  M(s)  is  a  matrix  of  transfer  functions  taking 

values  in  Cn*n  and  assume  that  each  element  of  M(s)  is  analytic  at  s  . 

o 

In  addition  assume  that  M(sq)  is  nonsingular  and  .that  the  singular 

values  of  M(s  )  are  distinct  and  ordered  so  that  a,  >o»>...>a  >0.  Then 
°  l  t  n 

the  Laplacian  of  the  logarithm  of  each  singular  value  is  given  by 


k*i 


3x  ui' 
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on  I . 

i7“kl 


*1 


Proof :  See  Appendix  H.  ■ 

Note  (9.25)  proves  that,  in  general,  log  is  not  a  harmonic 

2 

function.  In  particular,  at  points  for  which  V  logo^  <  0,  it  follows  that 

log  is  locally  superharmonic  [50  ] .  This  means  that  can  have  local 

2 

maxima  but  no  local  minima.  Similarly,  if  7  logc^  >0,  it  follows  that 

log  is  locally  subharmonic  and  can  have  local  minima  but  no  local  maxima. 

By  contrast,  recall  that  harmonic  functions  can  have  no  local  maxima  or 

minima.  Note  that  the  largest  singular  value  is  always  subharmonic  and  that 

the  smallest  singular  value  is  always  superharmonic. 

A  final  motivation  for  the  additional  study  of  singular  vectors 

may  be  obtained  by  comparing  (9.25)  to  the  formulas  (9.17)-(9.20)  for  the 

derivatives  of  the  singular  vectors.  Obviously,  if  the  singular  subspaces 
H  3vi  H  3vi  H  3ui  H  3ui 

are  constant,  vfc  -rj-,  vfc  u^  and  ufc  — ,  Vk^i,  must  all  equal 

zero.  From  (9.19)  and  (9.20),  it  follows  that  v^  u^,  etc.,  must  also 

2 

equal  zero.  Consequently,  V  logo^  *  0,  Vi.  Thus  properties  of  singular 
values,  (i.e.,  the  extent  to  which  log  fails  to  be  harmonic)  are  related 
to  that  component  of  singular  vector  motion  given  by  the  motion  of  the 


singular  subspaces. 


CHAPTER  10 


MATHEMATICAL  STRUCTURE  OF  SINGULAR  VECTORS,  I 


10.1.  Introduction 

The  purpose  of  this  chapter  is  to  develop  a  mathematical  framework 
within  which  to  study  properties  of  the  singular  vectors  obtained  from  the 
singular  value  decomposition  of  a  matrix  transfer  function.  Once  these 
properties  have  been  described  they  will  be  used,  in  Chapter  11,  to 
explain  the  meaning  of  the  differential  form 


H  ^i  H  3ui 
Vi  3x  “Ui  3x 
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which  was  introduced  in  Chapter  9. 

First,  it  will  be  shown  that  the  unit  sphere  in  In,  the  space 
in  which  singular  vectors  lie,  has  the  mathematical  structure  of  a 
principal  fiber  bundle.  Using  this  structure,  various  notions  of  the  phase 
of  a  singular  vector  and  the  phase  difference  between  a  pair  of  singular 
vectors  can  be  introduced.  A  complete  set  of  coordinates  for  the  singular 
vectors  may  be  obtained  from  the  phase  coordinates  just  described  as  well 
as  a  set  of  coordinates  for  the  complex  projective  space  (EPn  *.  This  space 
consists  of  the  directions  in  (En,  a  concept  introduced  in  Chapter  6. 
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10.2.  Fiber  Bundles  and  Singular  Vectors 


In  order  to  study  the  differential  equations  governing  the  motion 
of  singular  vectors,  it  is  first  necessary  to  describe  the  structure  of  the 
space  in  which  these  vectors  lie.  The  relevant  space  is  the  unit  sphere  in 
<En;  in  this  section  it  is  shown  that  the  unit  sphere  has  the  structure  of  a 
principal  fiber  bundle.  This  structure  naturally  allows  a  unit  vector  to 
be  described  by  giving  its  direction  (see  Chapter  6)  as  well  as  a  parameter 
analogous  to  phase. 

First,  the  set  of  directions  in  (ln  will  be  described  as  points 
in  complex  projective  space,  a  manifold  with  complex  dimension  (n-1) . 
Definition  10.1  [51,  p.  47;  52,  p.  114]:  For  each  integer  n  identify  ]R  n 
with  (En  and  denote  the  points  by  z-  (z^,...,zn).  Then  the  unit  sphere 
S2n  *C(En  is  given  by 

S2n-1  -  {  (z. ,  •  •  • ,  z  )|  ?  !  z  .  |  2  =»  1 }  -  (10.2) 

1  n  i.j.  i 

Define  an  equivalence  relation  on  S2n  ^  by  z-z'  if  and  only  if  there 
exists  a  complex  number  X  with  |x|  *  1  such  that  z-\z'.  The  quotient  space 
S  is  denoted  by  IP  and  is  termed  (n-1) -dimensional  complex  pro¬ 

jective  space.  Given  z€S2n  ^  the  corresponding  point  in  (IPn  *  is  denoted 

[*].  ■ 

The  vectors  in  a  given  equivalence  class  -can  be  distinguished  by 


assigning  to  each  a  number  analogous  to  phase.  This  may  be  done  by  choosing 
an  arbitrary  vector  z  in  the  equivalence  class  and  defining  the  phase  of 
other  vectors  z£[z]  by  the  rule 


© 


In  particular  each  equivalence  class  may  be  placed  in  one-to-one  corre¬ 


spondence  with  the  unit  circle  S *  by  defining  a  point  on  the  unit  circle 

1  0 

in  the  complex  plane  by  eJ  ,  9£(-iT,7r]. 

The  above  observations  may  be  used  to  show  the  unit  sphere  in  (In 
has  the  structure  of  a  principal  fiber  bundle.  A  layman's  description  of 
fiber  bundles  and  their  application  to  physics  is  found  in  [53];  a  precise 
treatment  is  found  in  [54].  The  basic  definitions  and  properties  of  fiber 
bundles  are  now  described.  Following  the  mathematical  preliminaries  a  dis¬ 
cussion  of  their  relevance  to  the  study  of  singular  vectors  will  be  given. 

The  following  definitions  are  taken  from  [54],  In  the  interest  of  brevity, 
not  all  terms  will  be  defined;  however,  references  to  the  relevant  pages  of 
[54]  will  be  given. 

Definition  10.2  [54,  p.  26]:  A  principal  fiber  bundle  (PFB)  consists  of  a 
manifold  P  (called  the  total  space) ,  a  Lie  group  G  [54,  p.  18],  a  base 
manifold  M,  and  a  projection  map  tt  :  P-*-M  such  that  (A),  (B)  ,  and  (C)  are 
satisfied. 

(A)  The  group  G  acts  freely  and  dif f erentiably  on  P  to  the  right 

[54,  p.  26]. 

(B)  The  map  ir  :  P  -*-M  is  onto  and  it  *(iT(p))  =  {pg:g€:G}.  If  x£M, 
then  t  *(x)  is  called  the  fiber  above  x. 

(C)  For  each  x£M  there  is  an  open  set  with  x€  and  a  diffeo- 

morphism  [54,  p.  7]  T  :  tt-1  (U  ) -►  U  *G  of  the  form  T  (p)  =  (*(p), 

a  a  a  ci 

s  (p))  where  s  :  ir-1(U  ) -*  G  has  the  property  s  (pg)=s  (p)g  for  all 

0t  CC  01  ut  Cl 

g€  G,  p€7r-1(u  ).  The  map  T^  is  called  a  local  trivialization.  ■ 
Note  (C)  states  that  locally  the  PFB  must  look  like  the  product  of 


the  base  space  with  the  Lie  group. 


Definition  10.3  [54,  p.  271:  Let  T.  :  tr  (U  )-*•!]  *G  and  T„  (U„)-HJ„xG  be 

"  "  1  '  "■  '  1 "  *“  '  1  01  Cl  Ot  p  p  p 

two  local  trivializations  of  a  PFB  it  :  P-*-M  with  group  G.  The  transition 

function  from  T  to  T„  is  the  map  g  „  :  U  <"iu„-K5  defined,  for  x=tt(p)Gu  no., 
-  a  g  r  °ag  a  g  r  a  g 

by  g  (x)  ■  s  (p) [sQ (p) l”1.  Note  that  g  (x)  is  independent  of  the  choice  of 

Cip  01  p  OLp 

p  €  if  *  (x)  because 

s  (pg)[s  (pg)]”1  -  s  (p)g[s  (p)g]-1  -  s  (p)[s  (p)]"1. 
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The  following  properties  hold: 

(i)  gaa (y)  «e  for  all  y£Ua,  where  e  is  the  identity  element  of  the 
group  G. 

(ii)  gga(y)  -  [g^Cy)]”1  for  all  ySu^Ug. 

(iii)  ga0(y)8gY(y)8Ya(y)-e  for  all  ye^nu^.  ■ 

The  transition  functions  describe  how  the  products  Ua*G,Ug*G,  etc. 

piece  together  to  form  the  total  space  P.  Thus  P  may  be  considered  as  the 

space  obtained  from  the  disjoint  union  (Ua*G) U  (UgxG) U  . . .  by  identifying 

the  point  (x,g)€U  *G  with  (x.g')eu  *G  if  g=g  (x)g'. 

a  pap 

Definition  10.4  [54,  p.  27]:  A  local  section  of  a  PFB  tt  :  P-*-M  with  group  G 

is  a  map  p  :U  -*-P  (UCM,U  onen)  such  that  ir°p  *  1  =  the  identity  function 
a  a  a  a  a  a 

on  U  .  ■ 

a 

Theorem  10.5  [54,  p.  27]:  There  is  a  natural  correspondence  between  local 

sections  and  local  trivializations.  ■ 

Proof:  If  p  :D  +  P  is  a  local  section,  then  define  T  :  it  ^(U  )-*-u  xG  bv 
-  "a  a  a  a  a 

Ta  (pa(x)g)  =  (x,g) .  Conversely,  given  a  local  trivialization 

T  XG,  define  a  local  section  p  :  U  -*-P  byp  (x)  =  T  ^(x,e),  where 

a  a  a  a  a  a  a 


e  is  the  identity  element  of  G. 


Definition  10.6  [54,  p.  27]:  If  is  a  local  trivialization  with 

Ua~M(L.e.,  :  P  +  MxG),  then  is  called  a  global  trivialization,  and  the 

PFB  is  called  trivial  if  such  a  T..  exists.  A  local  section  p  :U  -*-P  is 
— — —  m  a  a 

called  a  global  section  if  U  "M.  Global  sections  correspond  to  global 
trivlalizations.  ■ 

In  the  present  case  the  total  space  is  the  unit  sphere  in  <En, 
identified  with  S^n  *  Cl^n,  and  the  base  manifold  is  complex  projective 
space  (EPn  The  Lie  group  is  the  scalar  unitary  group 

U(l)  -  {ej9;  eeiG  (10.4) 


and  the  projection  map  is  that  taking  a  unit  vector  in  (E  to  its  equivalence 
class  via  the  relation  in  Definition  10.1. 


Condition  (A)  of  Definition  10.2  means,  for  the  present  purpose. 


that  for  each  element  g-e^9€:U(l)  there  exists  a  mapping  R  :  S^n  *  ■+•  S^n  * 

O 


defined  by  R  (z)  ■  ze^9,  zGS^n  and  that  this  mapping  is  nicely  behaved. 


Condition  (B)  means  that  each  fiber  of  the  bundle  looks  like  the  Lie  group 


,1 


U( 1)  and  thus  like  the  unit  circle  S  .  This  motivates  referring  to  the  set 

n 


of  unit  vectors  in  G  as  a  circle  bundle  over  complex  projective  space. 
Condition  (C)  may  be  interpreted  as  stating  that  locally  it  is  possible  to 
define  the  phase  of  a  vector  signal.  This  is  now  discussed. 

The  discussion  preceding  (10.3)  shows  how  one  might  place  each 
fiber  in  one-to-one  correspondence  with  the  interval  (—it , tt ]  and  thus  with 
the  Lie  group  U(l).  This  suggests  one  way  to  construct  local  trivializations 
The  definition  of  9(z)  in  (10.3)  can  obviously  be  extended  to  all  vectors 


not  orthogonal  to  the  reference  vector  z;  thus  the  neighborhood  U  in  (C) 


n-1 


can  be  extended  to  include  all  points  in  <EP  representing  subspaces  of 


(I  not  orthogonal  to  that  spanned  by  z.  The  corresponding  diffeomorphism 

T  :  it  *(U  )-HJ  *g  is  given  by 
z  z  z 


Tz(y)  -  ([y],ejez<y))  (10.5) 

with 

0  (y)  ■  *  zHy.  e  €(-*,*]. 

z  z 

j0l 

Note  that  0  (ye  )“0,  (y)+0,  as  required.  Again,  this  construction  shows 

that  it  is  possible  to  define  the  phase  of  unit  vectors  in  (En  locally  and 

that  the  domain  of  definition  can  be  extended  to  include  all  vectors  not 

orthogonal  to  a  given  reference  vector. 

From  Chapter  6  recall  that  6  (y),  defined  in  (10.5),  has  the  following 

Z  s  t 

physical  interpretation.  Given  two  sinusoidal  vector  signals  y(t)»ye  ° 
s  t 

and  z(t)  -z  e  ,  then  0z(y)  determines  how  the  signals  interfere  when 
added  together: 


!ly(t)  +  z (t)  II 2  -  e 


x  t 
o 


Jifi 7-  H 


Rez  y 


x  t 
o 


/2  J \  +  |  zHy  |  cos©  (y)  . 


(10.6) 


If  z  and  y  lie  in  orthogonal  subspaces,  then  no  interference  takes  place; 
thus  it  is  reasonable  that  this  definition  of  phase  cannot  be  extended  to 
a  global  definition. 

The  concept  of  a  local  section  (Definition  10.4)  can  be  viewed  as 
a  local  choice  of  a  reference,  or  phase  zero,  vector  in  each  fiber.  This 
can  be  seen  from  the  proof  of  Theorem  10.5;  the  reference  vector  in  the 
fiber  over  [y]  is  given  by 


P,([yl)  -  T“1([y] ,1) . 


(10.7) 


It  follows,  given  a  local  trivialization  and  the  local  section  defined  by 
(10.7),  that  a  unit  vector  w  may  be  decomposed  as 

w  ■  p  ([w])e^9z^  .  (10.8) 

z 

Suppose  that  a  global  trivialization  (Definition  10.6)  existed. 

This  would  imply  the  existence  of  a  definition  of  phase  valid  for  all  unit 
vectors  in  (En;  by  Theorem  10.5  this  would  also  imply  the  existence  of  a 
continuous  choice  of  reference  vector  in  each  direction  of  <Ln. 

Note  if  such  a  global  trivialization  existed,  then,  by  definition, 
there  would  exist  a  homeomorphsim  T  :S2n  ^xS^.  Now  it  is  a  fact  from 

topology  that  homeomorphisms  preserve  the  property  of  simple  connectedness. 

The  existence  of  the  map  T  would  therefore  imply  that  either  a)  both  S2n  1 
and  <EPn  are  simply  connected,  or  b)  neither  space  has  this  property.  It 

can  be  shown,  however,  [55,  p.  43]  that  the  unit  sphere  Sm  is  simply  connected 
if  and  only  if  m>  1.  Thus  the  unit  sphere  in  ®n,  n>  1,  is  simply  connected 
while  (and  thus  the  product  space  <CPn  ^xS"'")  is  not.  This  contradiction 
shows  that  no  map  T  with  the  desired  properties  exists.  Thus  it  is  not 
possible  to  obtain  either  a  method  of  defining  phase  of  vector  signals  which 
is  valid  globally  nor  (equivalently)  does  there  exist  a  continuous  choice  of 
phase-zero  reference  vectors. 

The  preceding  concepts  will  now  be  illustrated  by  using  the 

n  i  t 

standard  basis  vectors  for  C  ,  {e^  =  [00  ...  1  ...  0]  ;  i  =  l,...,n}  to 
define  local  trivializations ,  transition  functions,  and  local  sections. 


Consider  a  vector  p*  [p.,...,p  ]  SS  and  assume  that  e.p^O. 

in  I 

Then  p  lies  in  the  domain  of  definition  of  the  local  trivialization  defined 


where 


JMp) 

T^p)  -  ([p].e  ) 


9±(p)  -  *  e^p 


*  Pi* 


(10.9) 


(10.10) 


The  transition  functions  gik([p])  and  local  sections  p_^( [p] )  may 
be  computed  using  an  arbitrarily  chosen  vector  p  in  the  equivalence  class  [p]. 
The  transition  functions  are  given  by 


glk([pD  -  exp[j  *  pfcpi] 


(10.11) 


where  pfc  and  p  are  the  kth  and  ith  components  of  the  vector  p.  The  local 
sections  may  be  defined,  as  in  the  proof  of  Theorem  10.5,  by 


Pi((p])  ■  exp[-j  *  p  ]  -p. 


(10.12) 


Choosing  the  local  section  p^([p])  as  in  (10.12)  may  be  viewed  as  choosing 
for  a  reference  vector  in  each  subspace  that  vector  with  the  ith  component 
positive  real.  This  is  possible  provided  the  fiber  above  [p]  is  within  the 
domain  of  definition  of  T^;  i.e.,  provided  p^^O.  Note  that  the  local 
sections  are  related  by 


Pk((pl)  *  P1((p])gik((pl) 


(10.13) 


while  the  phase  coordinates  in  different  local  trivializations  are  related  by 


j  9± (P) 


gik([p])e 


jek(P) 


(10.14) 


Since  the  unit  sphere  in  <tn  may  be  identified  with  S^n  *CF^i 
it  follows  that  2n-l  real  coordinates  are  needed  to  describe  a  vector  in  (En. 
Suppose  a  vector  lies  within  the  domain  of  definition  of  the  local  trivial i- 
zation  T^.  Then  may  be  used  to  assign  a  coordinate  analogous  to  scalar 
phase  to  this  vector.  Thus  unit  vectors  not  orthogonal  to  e^  may  be 
specified  by  giving  the  appropriate  point  in  CPn  ^  plus  the  phase 
coordinate  corresponding  to  T^ 

A  method  for  assigning  the  2n-2  real  coordinates  needed  to  describe 
a  point  in  (EPn  *  is  now  developed.  Note  in  particular  that  this  assignment 
is  completely  independent  of  the  local  trivialization  used  to  distinguish 
vectors  in  the  fiber  above  the  point. 

2 

First,  consider  the  unit  sphere  in  <E  .  The  set  of  equivalence 

classes  under  the  relation  given  in  Definition  10.1  may  be  mapped  onto  the 
2  3 

unit  sphere  S  Q  3R  via  a  procedure  similar  to  one  given  by  Auslander  and 

MacKenzie  [64].  Coordinates  on  (EP*  may  then  be  identified  with  the  latitude 

2 

and  longitude  of  points  on  S  . 

T  2 

Indeed,  let  p  =  [p^  P2 ]  be  a  unit  vector  in  C  .  The  points  of  the 
projective  space  tP^  may  be  placed  in  one-to-one  correspondence  with  the 
extended  complex  plane  via  the  mapping 


Note  z(p)  is  a  function  only  of  the  subspace  spanned  by  p.  Moreover,  it 
is  not  essential  to  construct  the  correspondence  using  the  components  of  p 
written  in  the  standard  basis.  Any  other  basis  would  do;  this  fact  will  be 
used  later. 

By  mapping  the  extended  plane  onto  the  unit  sphere,  it  is  thus 
possible  to  place  the  points  of  <£P*  in  one-to-one  correspondence  with 
S^C  Defining  $  and  41  by 

A  P2 

tan  $  ”  —  ,  0<4>£ir/2 

P1 

and  (10.16) 

A  P2 

Y  -  *  —  -IT  <  ?  <  IT 

P1 


yields  a  set  of  coordinates  on  <EP  analogous  to  latitude  and  longitude 

T 

(Figure  10.1).  Note  that  the  subspace  spanned  by  e^*  [1  0]  maps  to  the 

T 

North  pole,  N,  and  that  spanned  by  *  [0  1]  maps  to  the  South  pole,  S. 

Thus  the  local  trivialization  is  defined  on  fibers  above  all  points  on 
the  sphere  excepting  the  South  pole.  An  analogous  statement  holds  for 
and  the  North  pole. 

Now  that  coordinates  have  been  assigned  to  the  base  manifold  ®P^, 
explicit  expressions  may  be  given  for  the  transition  functions  and  local 
sections  of  (10.11)  and  (10.12).  The  local  sections  defined  by  (10.12)  may  be 


(10.17b) 


Thus  a  given  unit  vector  in  I  whose  projection  has  its  coordinates  in 
2 

S  -  {N,S}  may  be  written,  as  in  (10.8),  as 


and 


P 


cos  4> 

If 

sin  <j>  eJ 


using  T^ 


P 


-If 

cos  4»  e  J 
sin  <j> 


using 


(10.18a) 


(10.18b) 


The  transition  functions  S^dpl)  “  are  given  by 


812^’^  "  e”^ 

(10.19a) 

and 

821(**'?)  "  e+^* 

(10.19b) 

j0l  j02 

Note  that  e  ■  g^2 (d» » e  as  required  by  Definition  10.3. 

Thus  e2  -  e  +y. 

3  2 

Note  that  the  expression  of  S  as  a  circle  bundle  over  S  is  the 

Hopf  fibration  [55,  p.  66].  This  construction  is  very  appealing  in  that  it 

2 

allows  one  to  visualize  the  unit  sphere  in  I  .  In  higher  dimensions, 
visualization  is  no  longer  possible;  a  mathematical  procedure  for  construc¬ 
ting  the  space  IPn  from  (EPn  *  is  given  in  [52,  pp.  114-115].  For  the 
purposes  of  this  thesis  it  is  sufficient  to  note  that  a  set  of  coordinates 
for  S2n  'cin  may  be  obtained  from  those  for  S2n  2  C  (En  *  via  the  following 
procedure.  Assume  that  p£ln  is  not  orthogonal  to  e^.  Thus  p  may  be 

written,  using  the  local  trivialization  T.  as  in  (10.8),  as 

JVp>  '  n-1 

p-p^([p])e  .  Then  coordinates  may  be  assigned  to  IP  using  the 

component  p^  to  define  cos  *  |p^|.  Thus  for  n*2 

j  8 

e 


cos  b, 


sin  b,  e 


(10.20) 


For  n  -  3 


For  n  ■  4 


cos  $ 


1 


3*, 


sin  cos  $2  e 


3** 


sin  sin  e 


38, 


COS  pj 

3*1 

sin 

cos  <t>2 

e 

3* 

sin  <j>^ 

sin 

cos  e 

3* 

sin 

sin  P2 

sin  e 

,  e  (-ir,n 

] ,  and 

0ie(-ir,ir] 

38, 


(10.21) 


(10.22) 


assignments  of  coordinates  to  (CP 

H 


n-1 


are  possible  (for  example  $  ^  could  be 


defined  from  cos  <(>^  ■  | [  *  iPjl)  and  the  phase  coordinate  defined  using 
may  still  be  chosen. 


From  (10.20)-(10.22)  one  can  easily  see  the  pattern  for  obtaining 
coordinates  for  IPn  from  those  for  (EPn  *.  Note  that  in  each  case  'F  repre¬ 
sents  the  phase  difference  between  the  (i+l)st  and  the  1st  components  of  p. 
Thus  'V ^  is  undefined  if  either  of  these  components  is  zero.  If  the  first 
component  of  p  is  zero,  then  of  course  the  local  trivialization  cannot 
be  used.  The  values  of  and  ¥  (defined  when  0  and  p^f*y)  may  still  be 
obtained,  however.  In  particular,  any  point  in  EPn  *  may  be  specified  by 
giving  a  set  of  coordinates  and  *?  .  Similarly  to  each  set  of  values 
of  p  and  4^  (noting  V  is  undefined  when  p^-0  or  it/2)  there  corresponds 
a  point  of  <£Pn  * . 


To  illustrate,  it  can  easily  be  seen  from  (10.21)  that  for  £  the 


local  trivializations  T^,  T2»  and  yield  phase  coordinates 

91  "  >  Pi 

(10.23a) 

e2  "  *  ?2 

“  91  +  *1 

(10.23b) 

03  "  >  ?3 

“  61  +  *2 

(10.23c) 

-  92  +  (4'2-4'1). 

Thus  the 

transition  functions  g^  »  (^.^.f^,*^)  are  given  by 

~ifl 

*12  *  e 

(10.24a) 

-^2 

gl3  “  e 

(10.24b) 

-j  (4'2-4'1) 

®23  *  6 

(10.24c) 

Finally, 

the  local  sections  p^((EPn  S  -*  S^n  *,  defined  in  (10.10), 

may  be 

expressed  using  the  above  coordinates.  For  (I  ,  the  local  sections 

A 

Pi  *  pi  (4*1 corresPondin8  t0  the  local  sections  T,,,  and  T3  are 


Pi 


cos 
sin  ip. 


cos 


(10.25a) 


sin 

cos  <t>2 

sin 

sin  <j>2 

-J*2 

cos 

e 

sin 

cos  <t>2 

sin  ^ 

sin  <j>2 

j<VV 


(10.25b) 


jcW 


(10.25c) 


The  above  construction  may  be  used  to  assign  a  complete  set  of 
(2n-l)  real  coordinates  to  each  member  of  a  pair  of  left  and  right  singular 
vectors.  Let  these  vectors  be  denoted  by  v  and  u,  respectively.  To 
illustrate,  assume  that  v  and  u  both  lie  in  the  domain  of  definition  of  T^ 
Then  v  and  u  may  be  represented  as 


u  -  oi([u])e  (10.26a) 

j9iv 

V  -  Pi([v])e  .  (10.26b) 

(Note  the  subscript  refers  to  the  numbering  of  the  local  trivalization  T. 


v  -  Pi([v])e 


defined  using  the  ith  standard  basis  vector  and  not  to  any  ordering  of  the 
singular  vectors.) 

Recall  (Appendix  B)  that  a  pair  of  singular  vectors  is  not  uniquely 

defined,  even  if  the  associated  singular  value  has  multiplicity  one.  If  (u,v) 

is  a  choice  of  singular  vectors,  then  (u,v)  *  e^a(u,v)  is  another  choice. 

Thus  the  phase  coordinates  of  the  vectors,  e.  and  3.  ,  are  not  well  defined; 

iu  iv 

however,  the  phase  difference  coordinate 


0.-9.  -6. 
i  iv  iu 


(10.27) 


is  well  defined. 


One  way  to  give  a  rule  specifying  the  singular  vectors  is  to 

choose  the  right  singular  vector  to  be  the  phase  zero  vector  in  a  local 

trivialization.  The  left  singular  vector  is  then  uniquely  determined  from 

the  equation  v-^-Mu.  Thus,  given  the  local  trivialization  T^ ,  u  may 

be  chosen  so  that  9,  i  0.  It  then  follows  that  9,  -9.,  as  defined  by 

iu  iv  i  3 

(10.27)  for  arbitrary  u  and  v.  Note  that  these  various  phase  coordinates 
may  be  related  via  the  transition  functions  g^*  From  (10.11)  and  (10.14), 
it  follows  that,  given  an  arbitrary  pair  (u,v) , 


j6iv  n  j0kv 

e  -  g..([v])e 


^9iu  ,r  ,s  ^9ku 

e  -  gik((u])e 


where 


J0i  _  j0k 

*>  e  gik([v])gik([u])e 


S ik ( C u 3 )  “  exp[j  *  0kui] 
gik C C v3 )  *  exp[j  *  v^] 


(10.28) 


(10.29  a) 


(10.29b) 


The  parameterization  of  singular  vectors  may  now  be  completed  by 
assigning  coordinates  to  IPn  For  example,  with  n  -  2  and  choosing  the 
right  singular  vector  to  have  phase  zero,  the  singular  vectors  may  be 
written  using  the  local  trivialization  T^  as 


cos 


sin  6.  e 
lu 


(10.30a) 


sin  <b,  e 
lv 


J,lv  S  * 


(10.30b) 


Similarly,  using  T„ 


cos  6,  e 
lu 

sin 


cos  e 
lv 


(10.31a) 


sin 


(10.31b) 


The  functions  in  (10.28)  relating  different  phase  difference 
coordinates  may  be  written  using  these  coordinates  on  CPn  1.  For 
example,  from  (10.23)  for  n=3,  it  follows  that 


9  ■  9  4*  V  —  y 

H2  °1  ' 1 v  Tlu 


93  =  91  +  f2v-T2u 


(10.32a) 


9 2  +  (-2v’'i'lv)  '  ^u’V  ‘ 


(10.32b) 


As  discussed  in  Chapter  6,  the  phase  difference  8^  is  a  measure 

S  t  St 

of  how  the  ith  components  of  the  signals  v(t)  *  ve  and  u(t)  * ue 
interfere  when  added  together.  It  was  also  pointed  out  that  the  most 
meaningful  definition  of  phase  difference  is  that  given  by  0  »  $  u  v,  since 
this  indicates  how  the  component  of  v(t)  in  the  direction  spanned  by  u  will 
interfere  with  u(t)  when  the  signals  are  added.  Local  trivializations  using 
the  right  singular  vectors  will  now  be  developed,  along  with  transition 
functions  relating  the  resulting  measures  of  phase  difference  to  the 
measures  9 . . 


Let  {ui  ;  i=l,...,n}  be  a  basis  of  right  singular  vectors.  Consider 


the  local  trivial izat ion  obtained  by  setting  z =  in  (10.5) 


jeu  (p) 

T  (p)  -  dpi  »e  i  ) 
i 


(10.33) 


where 

0  (P)  -  *  A.  (10.34) 

i  1 

Note  (p)  is  defined  for  all  p  such  that  u^pj^O.  The  components  of  the 
vector  p  written  in  the  basis  u^  are  given  by 


where 


U 


v 


(10.35) 


U  -  [uL 


Since  different  choices  of  the  right  singular  vectors  will  yield 
different  values  for  the  components  u^p  and  the  phase  coordinate  9^  (p) ,  it  is 
not  clear  these  quantities  have  physical  significance.  When  the  vector  p 
in  (10. 33)— (10.35)  is  the  ithleft  singular  vector  v^,  however,  the  coordinate 


<v 


H 

UiVi 


(10.36) 


is  independent  of  the  choice  of  the  {u^}.  (Although  the  components  ujcvi» 
k^i,  are  not  independent.)  This  fact  is  appealing  since  the  phase  coordinates 
defined  by  (10.36)  have  physical  significance. 


CoordinaCes  for  (IP  may  be  assigned  to  the  components  of  v^ 
written  as  in  (10.35).  In  particular,  it  is  useful  to  define 

cos  «>i  -  lujvj  .  (10.37) 

u 

Note  that  magnitudes  of  the  components  u^v^  are  well-defined  (assuming  all 

singular  values  are  distinct),  but  that  the  arguments  are  not.  Nonetheless, 

given  a  basis  of  right  singular  vectors  then  coordinates  based  upon  the 

H  H 

difference  between  *  u^v^  an<*  *  utvi  ^ 

To  avoid  notational  difficulties,  for  the  remainder  of  this  section, 
let  u  and  v  be  a  pair  of  singular  vectors  corresponding  to  the  same  singular 
value;  i.e.,  Au  «  av.  Thus  no  subscript  distinguishing  among  the  singular 
values  and  vectors  will  be  used.  Define  6  ■  6u(v)  as  in  (10.36).  Assume  that 
both  u  and  v  lie  within  the  domain  of  definition  of  the  trivialization  T^ 
given  by  (10.9)-(10.10) .  Note  that,  for  the  remainder  of  this  section,  the 
subscript  "i"  will  refer  to  a  choice  of  trivialization  T^.  Thus  one  may 
write  (repeating  (10.26)-C10.27)  for  convenience) 


u  -  0i([u])e 


j9 


iu 


(10.38  a) 


and 


v  -  Pi([v])e 


j0 


iv 


(10.38b) 


where  oi([u])  and  Pi([v])  are  the  local  sections  defined  by  (10.12)  and 


*  H 

9  -  *  e  u 

iu  i 


0  *  H 

9,  •  *  e.v 

iv  i 


e.  •  9,..  -  9,.. 


(10.39a) 

(10.39b) 

(10.39c) 


a 
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Two  different  methods  of  measuring  phase  difference  between  the  pair  of  left 
and  right  singular  vectors  v  and  u  have  now  been  defined.  The  first  method  is 
to  measure  9^,  the  phase  difference  between  the  ith  components  of  v  and  u,  as 
in  (10.39).  The  second  method  is  to  use  the  natural  measure  of  phase  dif¬ 
ference  9  ■  9q(v)  as  in  (10.36).  From  (10.36),  (10.38)  and  (10.39),  it  may 
be  seen  that  6  and  6^  are  related  by 

9  -  9u(v) 

x.  H 
*  a  u  v 

-  e±  +  *[PiUu])]H  Pi([v])  .  (10.40) 


Define  the  function  g^^  =  g^  ( [u]  ,  [v] )  by 

g±  »  exp(j  * [P^([u]) ]H  Pi([v])> 

so  that 


(10.41) 


(10.42) 


Thus  g.^  may  be  used  to  relate  the  different  measures  of  phase  difference 
between  pairs  of  singular  vectors.  Note  in  particular  that  is  a  function 

only  of  the  subspaces  in  which  the  singular  vectors  lie.  Expressed  in  the 

1  3 

coordinates  for  HP  ,  and  the  associated  coordinates  for  S  ,  defined  by  (10.18) 

the  functions  g^  and  g^  are  given  by 

51  "  81(W W 

*  exp{j*  [cos  <j>u  cos  +  sin  sin  1>v  ^  ]  ,  (10.43a) 

52  -  82(VVVV 

m  exp{j*  [cos  cos  4>  e  JV‘v  ir  +  sin  $  sin  $  ] }  .  (10.43b) 

u  V  u  V 


•  .■*  .*•  . 


Finally,  suppose  the  right  singular  vector  u^  is  chosen  to  be  that 
given  by  the  local  section  p ^ < [u] ) : 

u±  -  P^fu])  (10.  44) 

(i.e.,  by  setting  0  = 0  in  (10.26b)).  Then,  from  (10.27),  v  is  given  by 

v  -  Pi([v])e 

-1  j®i 
■  Pi([v])gi  g±e 

-  p([v])e^9  (10.45) 

where 

P(M)  -  Pi([v])g“l  .  (10.46) 


10.3.  Summary 

In  this  chapter  the  mathematical  structure  of  the  space  in  which 
singular  vectors  lie  has  been  studied.  It  has  been  shown  that  this  structure 
is  that  of  a  principal  fiber  bundle;  this  fact  in  turn  was  used  to  study 
the  existence  of  a  global  definition  of  phase  or  phase  difference  for 
singular  vectors. 

By  using  the  phase  or  phase  difference  coordinates,  as  well  as  sets 


of  coordinates  assigned  to  complex  projective  space  (the  base  space  of  the 


fiber  bundle),  a  complete  set  of  coordinates  for  pairs  of  singular  vectors 
was  obtained.  Rules  for  transforming  from  one  coordinate  system  to  another 
were  also  derived. 

An  important  advantage  to  describing  singular  vectors  using  the 
fiber  bundle  structure  is  that  the  concept  of  direction  of  a  signal  can  be 
formalized.  One  can  assign  coordinates  specifying  the  direction  of  a 
signal;  given  a  local  trivlalization,  a  phase  coordinate  may  then  be  used 
to  distinguish  among  the  unit  vectors  lying  in  the  same  direction. 


243 


CHAPTER  11 

MATHEMATICAL  STRUCTURE  OF  SINGULAR  VECTORS,  II 

11.1.  Introduction 

The  purpose  of  this  chapter  is  to  describe  the  geometric  structure 
of  the  differential  equations  governing  singular  vectors.  The  relation 
between  singular  values  and  various  measures  of  phase  difference  between 
singular  vectors  is  explored.  It  is  shown  that,  locally,  knowledge  of  the 
singular  values  and  singular  subspaces  suffices  to  determine  the  phase 
difference  to  within  a  constant.  The  fact  that  the  motion  of  the  singular 
subspaces  affects  the  phase  difference  is  a  multivariable  system  property 
which  has  been  suggested  by  the  example  in  Chapter  8.  The  generalized 
Cauchy- Riemann  equations  presented  in  this  chapter  describe  this  behavior 
mathematically.  Thus  the  groundwork  is  laid  for  generalizing  the  Bode 
gain-phase  relations  in  Chapter  12.  Finally,  a  systematic  method  for 
deriving  equations  constraining  transfer  function  matrices  is  developed. 

This  method  is  appealing  in  that  it  is  independent  of  coordinates ;  hence 
any  set  of  coordinates  may  be  chosen  and  the  relevant  equations  derived. 


11.2.  Differential  Equations  and  Connections  in  Fiber  Bundles 

As  in  Chapter  10,  identify  the  unit  sphere  in  Cn  with  the  unit 

sphere  S^n  ^  C  ]R^n.  Consider  the  principal  fiber  bundle  with  total  space 

2ii“l  n  1 

S  ,  Lie  group  U(l) ,  base  space  CP  and  the  projection  map  taking  each 

vector  w  to  its  equivalence  class  [w]  as  in  Definition  10.1.  In  this  chapter 

the  projection  map  in  Definition  10.2  will  be  written  as 


This  is  possible  since  there  exists  a  one-to-one  correspondence  between  the 

points  of  CPn  ^  and  the  projection  operators  onto  the  one-dimensional 

subspaces  of  dn  corresponding  to  these  points. 

Given  a  point  w6  S2n  \  consider  the  set  of  curves  w(t)  through  w  with 

the  property  that  w(0)  ■  w.  The  set  of  tangent  vectors  at  a  point  wSS^n  ^ 

is  denoted  TwS2n  ^  and  is  called  the  tangent  space  at  w.  Given  a  vector 
2n-l  H 

T  S  ,  the  differential  form  w  dw  evaluated  on  X  is  given  by 
w 

H,  H,  .  dw  H 

w  dw(X)  *  w  (t)  ~r~  -  w  X  , 

dt  t-0 

dw 

where  w(t)  is  any  curve  satisfying  w(0)  ■  w  and  —  *  X. 

2__i  dt  t-0 

The  tangent  space  to  w€  S  may  be  characterized  as 
T  S2n_1  -  {X€t  ]R2n|  Re[wHX]  -  0} 

In  Section  9.3  two  subspaces  of  TwS2n  ^  with  special  properties  were  singled 

out  for  study.  These  subspaces  will  now  be  characterized. 

First,  let  w(t)  be  a  curve  in  S2n  ^ .  Then  w(t)  will  be  called 

a  "minimum  energy"  curve  if  the  condition  wH  7^  *  0  is  satisfied  for  all  t. 

at 

The  set  of  all  tangent  vectors  at  w  with  this  property  will  be  denoted  by 

H  =  {XST  S2n_1|wHX  -  0}  .  (11. 

w  w  1 

Note  it  follows  from  the  definition  of  H  that,  for  all  g€u(l),  the 

w 

subspace  H  *  (Hw)g. 

A  second  set  of  curves  discussed  in  Section  9.3  were  those  of  the 
form  w(t)  *  we^a^C\  with  w  a  constant  unit  vector.  Tangent  vectors  to 


curves  with  this  property  are  given  by  that  subspace  of  T  S  defined 

w 

by 

V  -  (X  £  I  S 2n_1 1  -  X}  .  (11.3) 

w  w 

Note  that  the  circle  of  unit  vectors  in  ®n  lying  in  the  subspace  spanned 
by  w  may  be  identified  with  a  great  circle  in  S2n_1.  The  characterization 
(11.3)  of  Vw  states  that  the  tangent  vector  to  w(t)  at  w  must  be  tangent 
to  this  great  circle. 

For  later  reference,  note  that  V  has  the  alternate  characterization 

w 


V  " 0} 

To  show  this,  first  note  that  if  w(t)  is  a  curve  with  w(0)  *  w  and 

dt 

then, from  (11.1), 


(11.3)’ 

-  X 


TT*(X) 


£  *<w<t)) 


[w(t)wH(t)] 


dw 

dt 


w  (0)  +  w(0) 


H 


„  H  .  VH 
=  Xw  +  wX 

The  characterizations  (11.3)  and  (11.3)’  are  then  equivalent  provided 

ww^X  *  X 


XwH  +  wX1*  -  0 


To  show  this,  note  that 


XwH  +  wXH 


■  ww**(Xw**  +  wX^)  +  (I  -  ww**)  (Xw**  +  wX**) 

H  H  JI  H  H  H  H  H 

*  ww  Xw  +  wx  +  Xw  -  wX  -  ww  Xw  +  wX 

*  w(-  w**Xw**  +  X**)  +  (ww**X  +  X)w** 

=  w^w11  +  Xs)  +  (wwHx  +  x)wH 

H  u  h 

where  the  fact  that  ww  =  l**Xw+wX“0  was  used.  It  follows  from 
(*)  that  ww**X  +  X  ■  0  *•'  Xw**  +  wX**  ■  0.  Moreover  (*)  also  implies  that 


(*) 


(Xw**  +  wX**)w  -  w(X**ww**  +  X**)w  +  (ww**X  +  X) 
»  wX**w  +  wX**w  +  ww**X  +  X 

,**X  -  ww**X  +  ww**X  +  X 


-  ww 


=  X  -  ww**X 

Hence  XwH  +  wXH  =  0  »  X  -  ww**X. 

From  (11.2)  and  (11.3)  it  follows  that  the  tangent  space  may  be 
decomposed  as  the  direct  sum 


T  S2n_1  *  H  ®  V  (11. 

w  w  w 

Again,  note  the  subspace  consists  of  tangent  vectors  to  that  great  circle 
in  S2n  *"  identified  with  the  circle  of  unit  vectors  lying  in  the  subspace 
of  Cn  spanned  by  w.  The  subspace  Hw  consists  of  vectors  normal  to  this 
great  circle. 


Nov  consider  the  decomposition  (11*4)  applied  to  each  member  of  a 
pair  of  singular  vectors  corresponding  to  the  same  singular  value.  Recall 
from  Chapter  9  that,  if  the  singular  subspaces  were  constant,  then  the 
logarithm  of  the  singular  value  would  be  a  harmonic  function.  Hence  locally 
the  properties  of  the  singular  value  would  correspond  completely  to  those  of 
the  gain  of  a  scalar  transfer  function.  From  (11.4)  the  condition  that  the 
singular  subspaces  are  constant  is  equivalent  to  the  condition  that  the 
derivatives  of  the  singular  vectors  v  and  u  lie  entirely  in  the  subspaces 
Vv  and  Vu>  respectively.  Thus  properties  of  the  singular  value  deviating 
from  those  of  the  gain  of  a  scalar  transfer  function  must  be  due  to  the 
components  of  the  derivatives  of  v  and  u  lying  in  the  subspaces  Hv  and 
respectively.  In  particular,  the  failure  of  log  o  to  have  a  harmonic 
conjugate  implies  that  the  measures  of  phase  difference  defined  in  Chapter  10 
cannot  be  related  to  the  singular  value  in  the  same  way  that  phase  and  gain 
of  a  scalar  transfer  function  are  related.  In  order  to  get  a  better  picture 
of  the  way  in  which  these  quantities  are  related,  it  will  be  convenient  to 
introduce  the  notion  of  a  connection  in  a  fiber  bundle.  It  will  first  be 
shown  that  the  decomposition  (11.4)  satisfies  the  definition  of  a  connection. 
Then  an  equivalent  form  of  the  connection  will  be  introduced  which  is  much 
more  convenient  for  doing  calculations  and  gaining  insight.  This  form  of 
writing  a  connection  will  eventually  allow  a  generalization  of  the  Bode 
gain-phase  relations  to  be  obtained  in  Chapter  12. 


The  preceding  discussion  may  be  used  to  show  that  the  decomposition 
(11.4)  satisfies  the  following  definition.  Let  it  :  P  +  M  be  a  principal  fiber 
bundle  with  group  G  (in  this  thesis  always  U(l))  and  base  space 
M  of  real  dimension  n. 

Definition  11.1  [54.  P.  29 1 :  A  connection  assigns  to  each  p€P  a  subspace 

Hp  C  TpP  such  that  for  *  (XS  T^P } -rr^ (X)  *  0}  the  decomposition  TpP  *  Hp®  Vp 

holds.  It  is  required  that  R  .(H  )  «'H  and  that  there  exist  n  vector 

g*  P  Pg 

fields  (defined  on  a  neighborhood  U  of  p)  that  span  V  q€U.  The  subspace 

V  is  termed  the  vertical  subspace  of  T  P  and  H  is  termed  the  horizontal 
P  - c -  P  P  - 

subspace.  ■ 


In  the  present  case  the  map  R  :  P  +  P  is  given  by  R  (w)  *  weJ0t, 

g  g 


for  g  =*  e^a.  Thus  R  (X)  *  Xe^a  and,  from  the  definition  (11.2)  of  H  , 

©★  O 


it  follows  that  R  (H  )  *  H  *  H  .  Thus  the  decomposition  (11.4)  is 
g*  w  w  wg 


indeed  a  connection. 


tangent  space  to  S 
2n-l 


Definition  11.1  describes  a  connection  as  a  decomposition  of  the 
2n-l 


operator  to  :  TwS 


From  (11.2)  and  (11.3)  it  may  be  seen  that  the  linear 
{ja|a£]R}  defined  by 


w  (X)  *  wHdw(X)  -  wHX 
w 


(11.5) 


decomposes  TwS2n  ^  into  the  null  space  of  w,  (i.e.,  Hw)  and  its  orthogonal 
complement,  (i.e.,  V  ) . 

In  [54,  pp.  29-30]  a  definition  of  a  connection  equivalent  to 
Definition  11.1  is  given;  this  definition  focuses  on  the  linear  operator 
(11.5)  rather  than  T  S2n_*.  For  the  purposes  of  this  thesis  it  suffices  to 
note  that  to  defined  by  (.11 .5)  is  a  connection  one-form  defined  on  the  total 
space  P.  It  will  prove  useful  to  have  yet  another  equivalent  definition  of  a 


connection  as  defined  on  the  base  space  M  rather  than  the  total  space. 
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First,  some  notation  will  be  introduced.  If  G  is  a  Lie  group,  then 
denote  the  Lie  algebra  [54,  p.  18]  of  G  by  G.  In  the  present  case 
G  -  U(l)  *  {eJ<5(6€®}  and  G  -  U(l)  -  {j5j6€]R}. 

Definition  11.2  [54,  pp.  30-31];  A  connection  assigns  to  each  local  trivial- 

ization  T  :  ir_^(U  )-+-U  *G  a  G-valued  one-form  to  on  U  .  If  T„  is  another  local 

a  a  a  a  a  d 

trivialization  and  g  0:U  n  U0 -►  G  is  the  transition  function  from  T  to  T«, 

Cl  P  01  P  CX  P 

then  it  is  required  that 


ga8  dga6 


+  gaguagag 


(11.6) 


From  the  proof  in  [54,  p.  32]  it  follows  that  the  connection 
forms  to  in  Definition  11.2  may  be  obtained  from  the  connection  one-form 
to  using  local  sections  corresponding  to  local  trivializations  as  in 
the  proof  of  Theorem  10.5.  Consider  a  point  x€  CPn  1  with  x  in  U^,  the 
domain  of  definition  of  the  local  trivialization  T  ,  and  let  Y  S  T  <EPn  d. 

1  XX 

Then  ui  can  be  defined  by 

a 


Woa(x) 

-  D;jHtx)dPa(Yi!) 


or,  dropping  the  dependence  on  x  and  Y  , 


H, 

"a  =  °a  dps 


(11.7) 


where  Pa  is  the  local  section  CIO. 7)  defined  using  the  local  trivialization 

T  .  Recall  that  different  local  sections  are  related  via  the  transition 

a 

functions  as 

pg(x)  *  pa(x)ga6(x)  .  (11.8) 

Thus  it  follows  that 


dp 


B 


dpagct8  +  padgaS 


H. 

ps  dps 


pBHdpa8a8  +  p°Hp~dg" 


'B  ct  sag 


H. 


gaSPu  dpctgaB  +  ga8dgaS 


(11.9) 


Since  in  the  present  case  8ageU(l),it  follows  that  gag 


g“g*,  hence. 


(11.9)  reduces  to 


H.  -1J  .  -1  H, 

PS  dpS  =  gaSdga3  +  gaSpa  dpagaS 


-1 ,  .  H, 

gaSdgaB  +  Pa  dpa 


(11.10) 


Thus  condition  (11.6)  is  satisfied. 

Before  applying  the  various  definitions  of  a  connection  to  the 
study  of  singular  vectors  one, further  definition  will  be  helpful.  The 
following  paragraph  is  based  on  [54,  p.  37]. 

Given  a  connection  one-form  oj  on  the  principal  fiber  bundle 


”■  :  P  ■*  M  with  group  G,  any  vector  XGT^P  may  be  written  as  X  *  XV  +  X^,  where 


V  H 

X  €  v,  the  vertical  subspace,  and  X  €  H,  the  horizontal  subspace. 


Lemma  11.3  [54,  p.  37]:  Given  a  vector  field  X  on  M,  there  exists  a  unique 


vector  field  X  on  P  such  that  w(X)  =  0  and  r.(X  )  *  X  ,  N  for  all  p€p.  The 

*  p  t  (,p; 


vector  field  X  is  called  the  horizontal  lift  of  X. 
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To  apply  the  preceding  definitions,  let  w  be  a  unit  vector  in  C  , 

U 

and  assume  that  w  t  0,  so  that  the  local  trivialization  (10.9)  may  be 
applied.  Then  w  may  be  written  as 


w  =  p^e 


(11.11) 


where  6i  =>  0i(w)  is  the  phase  coordinate  given  by  (10.10)  and  pi  =  p  ([w]) 
is  the  reference  vector  given  by  the  local  section  (10.12). 

Let  a  :  (-e,e)  -*•  S^n  ^  be  a  curve  with  a(0)  =  w  and  let  X  be  the 
tangent  vector  to  a  at  t=0 : 


X  "  aCt) 


(11.12) 


t=0 


Then  writing  a(t)  =  oi(t)e^  9±  ^  ,  where  o^t)  =  o ± ( [w(t)  ] )  and 


0 i ( c )  =  5jL(w(t)),  gives 


X 


dt 


ifl.  ddi 

+  Die  1  «  ’dT) 


(11.13) 


t=0 


The  condition  used  to  define  the  horizontal  subspace  (11.2)  may  then  be 
expressed  as 

H  ,  .  H  doi  d0i 
w  dw(X)  -  P.  -jj-  +  j-fir 

=  0  .  (11.14) 

Thus  the  connection  one-form  can  be  written  as 


H  H 

w  dw  =  (p.  dc.)oTT  +  j  dS . 


l  l 


(11.15) 


It  follows  that  tbe  horizontal  subspace  may  be  characterized  ir.  terms  of 
the  local  trivialization  T^  as 

.  .  „  do  d9 . 

H  =  {xe  T  S  1|p.  —  +  j-~  -  0} 

w  w  1  i  dt  dt 


(11.16) 
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Recall  the  differential  equations  developed  in  Section  9.3  to 
describe  the  motion  of  the  singular  vectors.  These  differential  equations 
will  now  be  examined  using  the  concept  of  a  connection. 

Let  y(t)  *  x(t)  +  jy(t),  [0,1]  be  an  analytic  function  of  t. 
Assume  the  image  of  y  lies  in  an  open  connected  subset  of  the  complex  plane 
over  which  the  singular  values  of  M(s)  are  distinct.  By  this  assumption,  at 
each  point  of  y(t)  there  exist  n  uniquely  defined  right  singular  subspaces  of 
complex  dimension  one.  Let  the  singular  values  of  M  be  given  the  usual 
ordering  and  denote  the  right  singular  subspace  corresponding 

to  ai(t)  by  U^t)  -  ll^yCt)). 


In  Section  9.3  the  problem  of  determining  a  (nonunique)  right 
singular  vector  given,  a  well-defined  right  singular  subspace,  was  posed. 

In  the  terminology  introduced  in  Chapter  10  it  can  be  said  that  the  min-max 


property  (6.15)  determines  curves  U^(t)  in  the  base  space  (CP 


n-1 


and  the 


problem  consists  of  how  to  "lift"  these  curves  to  the  total  space 
2x1  X 

S  .  Motivated  by  the  terminology  of  Lemma  11.3,  the  choice  of  right 

singular  vector  obtained  by  fixing  an  initial,  condition  u.(0),  setting 
H  dui 

u^  anc*  inteSrating  the  differential  equation 

du. 


dt 

du 

H  i 

\  7T 


z  uk 
k*i  R 


2  2 

■w 


(■■■£) 


,  H  dM11  .  H  dM  , 
{°iUk  "dt~  Vi  +  °kVk  dFUi} 


along  y(t)  will  be  termed  the  horizontal  lift  of  Ui(t)etPn  d  to  S^n  d  with 
initial  condition  0^(0).  Note  the  right  singular  vector  obtained  in  this 
fashion  is  a  "minimum  energy"  singular  vector  discussed  in  Section  9.3;  as  in 
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r.m 
'  . 


that  section,  denote  this  vector  by  u*(t).  Observe  there  exists  a  family  of 
"minimum  energy"  singular  vectors  above  ll^(t),  indexed  by  the  initial 
condition  u^(0). 

Suppose  now  that  y(t)  is  a  closed  curve,  y(l)  ■  y(0).  Recall  from 

*H  du* 

Section  9.3  that  the  condition  u  =  0  was  interpreted  as  meaning  that 

u*  does  not  "spin"  as  it  moves.  (Until  further  notice,  the  subscript  "i" 

will  be  omitted.)  It  was  pointed  out  that  if  the  singular  subspace  U(t)  is 

constant,  then  it  necessarily  follows  that  u*(t)  not  spinning  implies  that 

u  (1)  *  u  (0).  If  ll(t)  is  not  constant,  however,  then  no  results  were  given. 

It  can  now  be  shown,  using  characterization  (11.16)  of  the  horizontal  subspace, 

that  in  general  u  (1)  /  u  (0).  To  see  this,  assume  for  simplicity  that 
H 

e^  u*(t)^0  Vt^  [0,1].  Then  the  local  trivialization  T^  may  be  used  to 
characterize  Hu*.  Equation  (11.14)  implies 

u*(l)  -  ejS  u*(0) 


where 


je 


-!(•■*)* 

\  (  H  3o  dx  H  3c  dy\ 

"  l[°  ^  dF  +  p  d tj 


dt 


I r  H  3o  ,  H  3o  , 

-  /  p  —  dx  +  o  —  dv 

Jy  3x  3y  ' 


(11.17a) 


(11.17b) 


(11.17c) 


and  d  *  Pj.  is  the  local  section  defined  in  (10.12). 

From  (11.17a)  it  follows  that  the  phase  discrepancy  8  is  a  function 
solely  of  the  path  traced  out  in  IPn  ^  by  the  subspace  LI ( t )  =  ll(y(t)). 


-4 


t  A  i  ’L  i 


f  >,r,  «:,  ».i)i 


* ..■**  • 


Thus  (11.17)  may  be  rewritten 


j S  -  -  /  PH  dp 

uct) 


(11.18) 


Note  the  integrand  of  (11.18)  is  the  connection  form  described  in  Definition 

11.2.  For  later  reference,  expressions  for  this  form  in  terms  of  the 
1  2 

coordinates  on  CP  and  CP  given  in  Chapter  10  are  now  derived.  For  n*2, 
it  follows  from  (10.17a)  that  coordinates  in  U  are  given  by  (<p,p)  and 


P 


COS0 

1  ^ 

sin eJ 


(11.19) 


dp 


-sin<)> 


|  cos$  e 


d<p  + 


(_sin<J>  e 


jd^ 


pHdp  ■  j  sin^<(>  dip 


(11.20) 

(11.21) 


(Note  it  is  still  assumed  that  local  trivialization  is  being  applied.) 
For  n=3,  it  follows  from  (10.25a)  that  coordinates  on  11  are  given  by 


p 


H, 

p  dp 


cos<^ 

sinb^  cosd^ 
sin<*>^  sinbj  e^2 

2  2  2 
j  sin  <fr^(cos  ^dij^  +  sin  <+>2<i^'2 ^ 


(11.22) 


(11.23) 


Elsewhere  in  this  thesis  interpretations  of  the  parameters  and 


■p,  were  presented.  In  examples  it  may  be  possible  to  identify  these  parameters 
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I 


r. 

i  . 


¥’ 

I*. 


with  elements  of  the  transfer  function  matrix  written  in  standard  coordinates. 
In  such  cases  the  expression  for  the  phase  discrepancy  in  terms  of  the  ^  and 
may  prove  useful.  This  will  become  more  clear  later. 

In  order  to  apply  Stokes'  Theorem  [49,  p.  124]  to  Equation  (11.17), 
assume  that  y(t)  is  a  simple  closed  curve  analytic  in  t.  Then  the  set 
C  *  (y(t)  ;  t€  [0,1]}  is  the  boundary  of  a  compact  subset  DC®.  Since  it  was 
assumed  that  the  singular  values  of  M(s)  were  distinct  over  D,  it  follows 
that  at  each  point  s  ■  x  +  jy£D  the  right  singular  subspaces  are  well-defined. 
Assume  that  e^u*(x,y)  j4  0  for  (x,y)  €  D.  Thus  the  local  trivialization  T^ 
may  be  used  to  characterize  Hulk  over  D.  Write 

*  j0 
u  =*  peJ 


From  (11.17c)  and  the  special  case  of  Stokes'  Theorem  known  as  Green's 
Theorem  [49,  p.  134] 


jS 


r  H  3p  ,  ,  H  3o  , 

-/  p  at  **  +  p  ^  dy 

w 


r  , .  H  9p  ,  H  9p  .  , 
-/  d[p  ^  dx  +  p  ^  dy] 

h  a2p 


9x9y^  + 


r  I*  f  3pH  iP 

i  L'  W  3x 

.  f  ( le.  _  1°  ^ 

3  \  3x  9y  9y  9x  J 

2J  (/  Ib  [%■ !?] dxdy 


3p^  3o  H  32oY|  . 

.IT? 7+°  37^jJdxAdy 


dxAdy 


(11.24) 


where  the  fact  that  mixed  second  partial  derivatives  are  equal  (Appendix  G) 
was  used.  Using  (11.21)  it  may  be  shown  that 


a  r  r  .  2  /96  3iL»  9©  9t]A 

S  -  //  sin  *  TT  37j  dxdy 


(11.25) 


£ 


v'O 

►  « 

,  - . 


■1 


•A  0.  0 


'■  *  o  '  ■ 
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Formulas  similar  to  (11.25)  hold  for  n  >  2  as  well.  An  alternate 
method  of  computing  6  which  considers  only  the  set  U(D)  C  £Pn  ^  will  now  be 
developed.  The  cases  n»2  and  n>  2  must  be  considered  separately. 


First  consider  the  case  n=2  and  the  mapping  11  :  D-*-4P^.  Denote  the 
image  of  C  under  the  map  11  :  C-*-CP^"  by  T  and  assume  that  T  is  a  simple  closed 
curve  in  the  domain  of  p.  Since  4P^  is  a  compact  connected  manifold  of  real 
dimension  two  (52,  p.  115],  it  follows  that  T  separates  4P1  into  two  subsets; 
moreover,  T  is  the  boundary  of  each  of  these  subsets.  It  is  a  fact  that  4Pn  is 
orientable  [52,  p.  119];  given  an  orientation  of  Cp\  it  follows  that  each  of 
the  two  subsets  has  an  orientation  inherited  from  that  for  CP^.  Each  of 
these  in  turn  induces  an  orientation  of  F  and  these  induced  orientations  are 
opposite  in  direction.  Choose  the  subset  of  4P1  yielding  the  induced 
orientation  of  T  which  agrees  with  the  direction  travelled  along  F  by  ll(t) . 
Denote  this  subset  by  A.  Stokes  Theorem  [49,  p.  124]  then  shows  that 


is 


jB 


r  H, 

-  )  D  dp 

r 


-  /  d(pHdp) 
A 


-  /  (dp Ado 
A 


(11.26) 


Using  (11.21) 


.u 


8  -  -  /  sin  <j>  d\|i 

r 


-  /  sin2<S  doAdt|) 
A 


(11.27)  - 


Note  that  3  may  be  computed  with  knowledge  only  of  F  and  its 


orientation.  Further  knowledge  of  the  mapping  U  :  D-*-CP  is  unnecessary. 


One  further  interpretation  of  (11.27)  is  available.  Recall  that 

1  2  3  2 

<£P  is  homeomorphic  to  S  £  R  .  Coordinates  for  S  are  given  by 

x^  ■  cos2$  [0,ir/2] 

x2  -  sin2$  cosi^  (-it.tt]  (11.28) 

x^  ■  sin24>  simp 

2 

From  Spivak  [49,  p.  131]  it  follows  that  the  element  of  surface  area  on  S 
is 

n  »  x1dx2Adx3  +  x2dx;jAdx1  +  x^dx^Adx^  (11.29) 

or 

*  2sin24>d<j>Adij>  .  (11.30) 

2 

Let  T  denote  the  curve  in  S  mapped  out  by  the  image  of 
s 

T£ CP1,  and  let  A  denote  the  interior  of  r  .  Then  from  (11.17a) 
s  s 

dt 

=  -j  /  sin2<f>  ^  dt 

*  -j  /  sin 

r 

s 

■  -j  /  sin 2$  d<pAd'4i 
A 

3 

1  2 

*  -  ^[area  of  A  C  s  ] 

2  s  — 

Thus  the  phase  discrepancy  3  is  proportional  to  the  area  of  the  subset  of 
2 

S  enclosed  by  I*  .  Hence  if  the  change  in  the  subspace  U  is  small,  the  area 


enclosed  will  be  small,  and  so  will  the  phase  discrepancy. 


In  the  general  case  n  >  2,  the  mapping  from  D  £  C  into  *P  is 
a  mapping  from  a  space  with  real  dimension  2  into  a  space  with  real 
dimension  2n-2  (>2  for  n>2) .  Thus  T  *  11(C)  is  not  the  boundary  of  any 
subset  of  CPn  \  However,  under  certain  conditions  [49,  pp.  109-114],  the 
image  A'  »  U(D)  is  a  submanifold  of  CPn  ^  with  boundary  C.  (Methods  for 
verifying  these  conditions  hold  are  available;  however,  this  will  not  be 
pursued  in  this  thesis.  Rather  they  will  be  assumed  to  hold  in  order  to 
illustrate  the  type  of  results  available.) 

Stokes  Theorem  may  be  applied  to  show 


-  /  P  dp 

r 

-  /  d(pHdp) 

A’ 


-  /  (dp)  Adp 

A' 


(11.31) 


(Note  the  definition  of  A'  in  (11.31)  is  different  from  that  of  A  in  (11.26)). 
For  n*3,  (11.23)  shows  that  the  integrand  of  (11.31)  is 

H  2  2 

(dp)  Adp  ■  j  [sin2<^(cos  ^d^Adil^  +  sin  ^d^Ad^) 


+  sin  ^si^^C-d^Ad’^  +  d4>2Adip2)  ] 


(11.32) 


Again,  it  follows  that  S  is  a  function  solely  of  the  image  of  f 


in  ®P  ,  not  of  the  original  curve  y(t)  bounding  DC®.  Note, 
moreover,  that  since  to  ■  o  dp  is  a  connection  with  values 


in  a  scalar  Lie  algebra,  it  follows  [54,  pp.  37-39]  that  da)  ■  (dp)  Adp  is  the 
curvature  of  w.  Thus  in  all  cases  n>2  it  follows  that  6  is  obtained  by 
integrating  the  curvature  of  a  connection  over  an  appropriate  surface. 


The  preceding  observations  may  now  be  used  to  interpret  the  meaning 
of  the  differential  form 


Slogo  Slogo 

Ai  - 1 1-  ->r d* + 


3x 


dy] 


..HI  .  .  .  1  T  r  H  3M  ,  , 
lBtTl  uiJd*  +  In[vi  37  “t)dy 

.  H  3V1  H  5ui,  .  H  5V1  H  3ui,, 

Ivi  nr  -  Ui  ir!dx  +  [vi  17  -  °i  i7,dy 


(11.33) 


It  will  be  seen  (Equations  (11.47)  and  (11.483  that  governs  how  the  measures 
of  phase  difference  9^  and  9^  change  as  a  function  both  of  the  singular  values 
and  the  singular  subspaces. 

First,  note  that  the  Laplacian  of  log  may  be  obtained  from 


/32log a.  3‘logo.  \ 


32loga . 


dxAdy 


(11.34) 


It  was  shown  in  Section  9.4  that  (11.34)  does  not  in  general  equal  zero; 
moreover,  the  deviation  from  zero  is  related  to  the  motion  of  the  singular 
vectors.  This  may  also  be  seen  from 


3v  H  3v  3u.H  3u 

7  log*!  ■  2{Imf-drir]  -  ir]} 


(11.35) 


(From  now  on,  the  subscript  "i"  will  be  suppressed  when  convenient.)  Assume 

H  H 

for  simplicity  that  both  e^  u  0  and  e k  v  j4  0.  Then  u  and  v  may  be 
written  in  terms  of  the  phase  coordinates  and  local  sections  given  by  the 
local  trivialization  T^  defined  using  the  standard  basis  vector  e^  as  in 
(10. 38)-(10. 39) : 


'.••I 
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u  -  p,  e 
ku 


J0ku 


(11.36) 


v  -  p,  eJ  kv  , 

kv 

where  pku  -  Pk<[u(s)])  and  pfcv  -  Pk([v(s)]).  (Note  that  pfcu  is  a  local 
section,  not  of  the  original  bundle  over  tPn  but  of  its  pullback  along 
the  map  [u]  :  D-^®Pn  \  A  similar  comment  holds  for  Pkv«)  If  u  is  chosen 
so  that  =  0,  then  (11.36)  becomes 


with 


u  -  p. 


je, 

v  *  p,  e  k 
kv 


9,  =  0.  „  -  0. 
k  kv  ku 


The  form  A  may  be  rewritten  as 


(11.37) 


A.j(.il£*°dx  +  U|S£dyl 

r  H  kv  H  ku.  .  k,  , 

■  ‘“kv  IT  *  °ku  IT  +  3  al  ldx 

.  .  H  5°kv  H  3cku.  ,  3\,  . 

+  ‘°kv  17  -  “ku  If  +  3  37 


di.38) 


From  (10.45)  it  follows  that  v  may  be  rewritten  in  terms  of  the 


phase  difference  J  «  *u  v: 


v  *  o  e~ 
v 


(11.39) 


where  o  ■  o([v(s)]).  Using  the  function  g.  (10.41)  it  follows  from  (10.42) 

V  K 


that 


(11.40) 


I 


iS-J 


and  from  (10.46)  that 

H  9P 


3x 


R.  .  —4. 

+  8,, 


3x 


_ v 

3x 


H  3pkv 


Jkv  3x  '  8k 


-1  3*k 


3x 


(11.41) 


Using  (11.40)  and  (11.41)  and  similar  equations  for  derivatives  with  respect 
to  y,  it  follows  that  (11.38)  equals 

dy) 


[P. 


H  3pv 


H  3pkUj.  ,  30,  . 
3x  Pku  3x  ^  3x 


u  3o 
H  v 


r  ..  *  H  3pku  .  ,  36 1  , 

[pv  3y  “  pku  3y  +  ^  3y  dy 


(11.42) 


Rearranging  the  form  (11.38)  yields 

-  i  (^3^  +  3^)dlt  +  1  -  8^)dy 

,  H  3°kv  H  3pku.  r  H  5pkv  H  3°ku, 

Cp’-  “  ’  p’--  ~  ldx+  [pkv  ~3y  "  pku  ~3y  ]dy 


kv  3x  ku  3x 
and  rearranging  the  form  (11.42)  yields 


-  [pvH  -  p,..H  ^-u]dx  +  [p..H  -  p.H 


ku  3x 


v  3y 


ku  3y 


]dy 


(11.43) 


(11.44) 


10  i  0 

Suppose  ceJ  k  (respectively,  aeJ  )  were  analytic.  Then  log  o  and 
“k  (.respectively,  log  a  and  9)  would  necessarily  satisfy  the  Cauchy- Riemann 
equations.  This  is  possible,  however,  only  if  the  right  hand  sides  of  (11.43) 

(respectively,  (11.44))  are  equal  to  zero.  These  equations  quantify  the  extent 

i  0  10 

by  which  motion  of  the  singular  subspaces  prevents  aeJ  k  and  aeJ  J  from 
satisfying  the  local  gain-phase  relations  given  by  the  Cauchy-Riemann  equations, 


V  V\-VV„"-W. 

. Vl  V  VV. ,  R.  1  V,V" .V.'vVwV-' 


Indeed,  if  y(t)  is  a  curve  in  a,  then  the  left  and  right  singular  subspaces 

<*  1 

map  ytt)  to  curves  9(t)  and  ll(t)  in  (£P  . 

Recall  now  that  a  left  singular  vector  v(t) £  S^n  *  is  said  to  be 
"minimum  energy"  if  v^1  -^  =  0.  Given  a  curve  9(t)£®Pn  ^  determined  by  a 
left  singular  subspace,  a  minimum  energy  left  singular  vector  above  9(t)  will 

JL  JL 

be  denoted  by  v  (t) .  From  Lemma  11.3  it  follows  that  requiring  v  (t)  to  be 
minimum  energy  above  V(t)  is  equivalent  to  requiring  that  the  vector  field 
tangent  to  the  curve  v*(t) £  S^n  ^  be  the  horizontal  lift  of  the  vector  field 
tangent  to  the  curve  JJ(t)€CPn  Now,  from  the  characterization  (11.16)  of 
the  horizontal  subspace,  it  follows  that  the  rate  of  change  of  the  phase 
coordinate  assigned  to  v*(t)  by  some  choice  of  local  trivialization  is 
determined  by  the  rate  of  change  of  the  corresponding  local  section  o  with 
respect  to  a  minimum  energy  vector.  This  latter  rate  of  change  is  given  by 
the  difference  between  the  tangent  vector  to  p (t) S  S^n  ^  and  the  horizontal 
lift  of  the  tangent  vector  to  9(t)€tPn  Moreover,  the  value  of  this  rate 

u 

of  change  may  be  found  by  evaluating  the  connection  form  p  dp  on  the  tangent 
vector  to  0(t) .  A  similar  discussion  can  be  made  concerning  u*(t) ,  a 
minimum  energy  singular  vector  above  the  right  singular  subspace  U(t) . 

Now  consider  the  forms  (11.43)  and  (11.44),  which  show  how  the 
functions  aeJ  k  and  aeJ  fail  to  satisfy  the  Cauchy-Riemann  equations. 
Evaluating  these  forms  along  a  tangent  vector  to  a  curve  y(t)£|  shows  that 
this  failure  is  quantified  by  the  difference  between  the  deviations  of  the 
local  sections  above  9(t)  and  U(t)  from  minimum  energy  vectors  above  these 
curves.  Hence  P(t)  and  U(t)  are  the  images  in  ®Pn  ^  of  the  curve  y(t)  in 
C  under  the  maps  [v]  :  y->-*Pn  ^  and  [u]  :  y  -*■  <tPn  \  respectively.  These  latter 
maps  are  those  which  determine,  via  the  mini-max  principle  (6.15),  the 


singular  subspaces  in  which  the  left  and  right  singular  vectors  are 
constrained  to  lie.  From  the  discussion  following  Equations  (11.36),  it 
follows  that  the  connection  forms  which  measure  the  failure  of  the  local 


sections  to  be  minimum  energy  vectors  are  now  no  longer  defined  in  the 

original  circle  bundle  over  SPn  \  but  in  its  pullback  along  the  maps 

[v]  :  y  -*■  ®Pn  ^  and  [u]  :  y  ■+  ®pn  respectively. 

One  method  used  to  gain  insight  into  the  behavior  of  scalar 

transfer  functions  was  to  use  an  analogy  with  two-dimensional  potential 

theory  [27].  This  analogy  is  also  used  in  explaining  theory  of  harmonic 

2 

and  analytic  functions  [56,63].  An  interpretation  of  V  logo  is  now  given 
using  such  an  analogy. 

Think  of  the  gradient  vector  Vlogo  =  ^  )  as  being 

the  velocity  vector  field  of  a  fluid  flowing  in  steady  state  along  the  surface 

2 

of  the  complex  plane,  and  identify  $  with  1R  .  Let  C  be  a  simple  closed 
2 

curve  in  E  parameterized  by  y(t)  *  (x(t),y(t)),  a  <  t  <  b.  Assume  that 
Y  is  traversed  counterclockwise  and  that  |^|  =  1.  Then  the  outward 

pointing  unit  normal  to  v(t)  is  given  by  H  = 

Vloga  •  N  *  )  is  positive  at  a  point  of  C,  then 

fluid  is  flowing  out  of  D  (the  interior  of  C)  at  that  point.  The  net  fluid 
flowing  out  of  D  is  given  by  the  integral 


(ta. 

Vdt 


dx 

dt 


)• 


Thus,  if 


net  flow  = 


/(^r  •- if  )•<*.-«  ■ 

rb/  Slog o  dx  Slogo  dv  \  . 

J  \~  Sy  dt  3x  dt  J 


(11.45a) 


(11.45b) 


(11.45c) 
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Using  Stokes'  Theorem, 


net  flow 


2  2 
<•  /  3  logo  3  logo 

J  \  2  +  2 

D  3x  3y 


^  dxAdy 


(11.46) 


Thus  if  log  a  were  harmonic,  the  net  fluid  flow  would  be  zero. 


For  the  largest  singular  value,  (9.25)  shows  that  V  log  a  >  0.  Thus  the 


net  fluid  flow  out  of  D  is  positive.  Similarly,  for  the  smallest  singular 


value  V  log  a  <  0.  Thus  the  net  fluid  flow  out  of  D  is  negative.  For 


intermediate  singular  values  no  general  statement  may  be  made.  However, 

n 

since  det  M(s)  is  an  analytic  function  it  follows  that  log|det  M|  =  IT  logo. 

i=l 

is  harmonic.  Thus  the  sum  of  the  net  flows  (11.46)  over  all  singular  values 


must  be  equal  to  zero.  For  example,  in  the  case  n=2,  it  is  as  though  the 
excess  fluid  entering  the  region  D  from  the  flow  Vlog  a  is  somehow  transferred 
and  leaves  D  in  the  flow  Vlog  a. 

From  (11.43)  and  (11.44)  it  follows  that  equations  analogous  to 


the  Cauchy-Riemann  equations  for  oe'^and  oe^ 6  are  given  by 


H  3°kv  H  ^ku 


dy  J  kv  3y  ku  3y 


H 

-  5y 

’  31ogc 

3x 

3y/ 

’31oga 

+  i£U 

H  ^kv,  H  3pku 
3kv  3x  pku  3x 


(11.47) 


H  3pku 


ku  3y 


•  p  u  3o 

v  +  H  ku 

3x  °ku  3x 


(11.48) 


Note  the  right  hand  side  of  (11.48)  may  be  obtained  from  that  of  (11.47) 


using  the  function  g^: 


r 
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H  9pkv  H 
3kv  Sx  Pv 


>pv  -1  3gk 
3x  gk  3x 


H  9pkv  H  ^v_,  -l^fk 
Pkv  3y  pv  3y  8k  3y 


(11.49) 


where  it  may  be  shown  from  (11.40)  and  (10.41)  that 
-1  8gk  3  H 


-1  °k  3  /  .  v  H  , 

gk  IT  =  pkupkv} 


-1  _^k  3  , .  *  H  , 

3k  3y  =  3y  Pkupkv} 


(11.50) 


Equations  (11.47)  and  (11.48)  may  be  expressed  in  coordinates 
on  <CPn  \  For  the  case  n*2,  k=l: 


,,  33-.  -  oip  0  3  it 

Sloga  1.2  v  .  2x  u 

— r3 - t  53  sm  <t>  — —  -  sm  4>  — — 

dx  3y  v  ay  u  3y 


30  3tl)  „  3\j; 

Sloga  ,  1  .  2 .  vv  .  .  2 ,  m 

— r°—  +  —  »  -  sin  6  — —  +  sm  d  -r — 

3y  3x  v  3x  u  3x 


(11.51) 


-M  -5a  -3'^  -  3>l< 

31oga  30  ,2  v  ,2  u 

— r3 - —  ■  sm  a  — - sin  j  — — 

3x  3y  v  3v  u  3y 


-  ^  [ coscp  cosO  +  sin$  sin4>  e^Pv  ^  ] 

3y  v  v  u  v 


Sloga  36  .2  3^v  .2.  3^u 

"IT  *  "  Sln  MT  +  Sln  *u  lx" 


+  —  McosO  cos$  +  sina>  sin$  e 
3x  u  v  u  v 


J  ^v-Yu)  j  .  (11.52) 


Similar  equations  may  be  derived  for  n  >  2  and  for  other  local  trivializations 


A  differential  one-form  oj(x,y)  =  A(x,y)dx  +  B(x,y)dy  called  exact 

3f  3f 

if  there  exists  a  function  f(x,v)  such  that  df  =  w  =  t —  dx  +  —  dv.  If  mixed 

’•  3X  3y 


second  partial  derivatives  of  f  are  equal,  then  dm  s  d  f  “  (— —  -  ) • 

dxAdy  =  0  and  to  is  said  to  be  closed  (Spivak  [49,  p.  91]).  A  standard 
result  is  that  all  exact  forms  are  closed.  Thus  if  two  one-forms  w.  and 
1^2  differ  by  an  exact  form,  it  follows  that  dio^  ■  du^.  Recall  it  was  shown 
in  Appendix  G  that,  at  points  where  the  singular  values  are  distinct, 
partial  derivatives  of  all  orders  of  the  singular  values  and  vectors  exist. 
Moreover,  away  from  singularities  in  the  coordinate  systems  used  to  describe 
the  singular  vectors,  this  property  is  preserved  so  that,  in  particular, 
mixed  second  partial  derivatives  of  9,0  $v>  0^,  etc.  are  equal. 

Now,  consider  the  forms  (11.43)  and  (11.44).  Since  d^ and  d0  appear 
on  the  left  hand  side  of  these  forms  it  follows  that  (11.43)  and  (11.44) 
differ  from  the  form  A  in  (11.33)  by  an  exact  differential.  Thus  taking  d 
of  the  left-hand  side  of  (11.43)  or  (11. £4)  yields  the  Laplacian  of  log  o 
just  as  in  (11.34).  Using  (11.51)  in  (11.43)  and  taking  d  gives  the  Laplacian 
in  coordinates 


7  logo 


.  30  30 

sin  20 

v  \  3y  3x 
.  30  30 

.  „ ,  ( _ u  u 

sln2*u  \  3x  3y 


30  30  . 

_  —L  N 

3x  3y  / 

30  30  \ 

u  _u  ] 

3y  3x  / 


(11.53) 


2'  2 

Equations  for  7  9^  and  7  9  may  also  be  derived  from  (11.43)  or 

(11.44).  Thus  what  can  be  done  using  the  star  operator  [54]?  For  our  purposes 

2 

it  suffices  to  define  *  for  one-forms  on  IR  .  Let  oo  *  Adx  +  Bdy  with  A  and 


B  real  valued  functions  of  (x,y)  ;  then  =  -Bdx  +  Ady.  Note  that  if 

m  *  df  ■  4^-  dx  +  dy,  then  ojA*^  *  [(4^)^  +  (t^O  ^  ]  dxAdv .  Note  also  that  the 

3x  3y  J  3x  3y 

integrand  of  (11.45a)  is  just  *d(log  a). 
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Applying  the  operator  *  to  (11.43)  and  using  coordinates  yields 


(for  n=2  and  k=i) 


/  2  2  **u\  f  2  H  v 

-  ^  sin  sin  ^  —  J  dx  +  ^sin  -57  * 


(11.54) 


•  2,  9V\h 
sin  MrJdy 


An  equation  for  V  8^  may  be  obtained  from  (11.54)  by  taking  d  of  both  sides: 


>-  /3$v  30v  3^v\ 

9l“  '  sin2<*v\  3y  3y  +  3x  ax  / 

(3$  dip  3  ip  dip  \ 

u  U  ,  U  U  ) 

3y  3y  3x  3x  / 

■ sln  hi? + sin 


AA'i 

1  \  3x2  3v2  / 


(11.55) 


2-  2 

Note  that  in  general  V  6^/  V  0;  the  form  (11.54)  and  the  analogous  form 
obtained  from  (11.44)  do  not  differ  by  an  exact  differential. 


11.3.  Cauchy’s  Integral  Theorem  and  a  Simple  Method  for  Deriving 
Differential  Equations 

In  Section  11.2  and  Chapter  9,  a  number  of  differential  equations 
were  derived  constraining  the  properties  of  a  matrix  transfer  function. 

The  basic  strategy  was  to  obtain  differential  equations  for  the  singular 

3M-  3M 

values  and  vectors  of  a  matrix  M(s)  in  terms  of  —  and  -^p-.  By  then 
observing  that  M(s)  must  satisfy  the  Cauchy-Riemann  equations,  it  was  shown 
that  the  singular  values  and  vectors  are  not  mutually  independent  functions. 


By  assigning  coordinates  to  the  singular  vectors  it  could  be  shown,  for 
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example.  Chat  knowledge  of  the  singular  values  and  singular  subspaces 
determines  the  phase  difference  parameters  9^  and  9^  to  within  an  arbitrary 
constant. 


It  should  be  pointed  out  that  the  generalized  Cauchy-Riemann 
equations  (11.47)  or  (11.48)  are  not  the  only  ones  constraining  the  behavior 
of  the  transfer  function  matrix.  Lemma  9.6  may  be  used  in  conjunction  with 
the  differential  equations  (9.17)-(9.22)  to  show  that  additional  constraints 
on  the  singular  vectors,  and  thus  the  coordinates  4>ui,  ^  and 

must  be  satisfied.  Rather  than  list  examples,  a  systematic  method  for 
deriving  a  complete  set  of  generalized  Cauchy-Riemann  equations  will  now  be 
presented.  The  basic  philosophy  is  that  constraints  on  system  properties 
imposed  by  local  analyticity  of  transfer  functions  are  completely  known. 
However,  this  knowledge  is  expressed  in  standard  coordinates  for  the  matrix, 
while  other  coordinates  are  more  meaningful  in  assessing  feedback  properties. 
Thus  the  goal  is  to  study  how  constraints  imposed  by  local  analyticity 
manifest  themselves  after  coordinate  transformations. 

In  addition  to  the  Cauchy-Riemann  equations,  locally  analytic 
functions  must  satisfy  Cauchy's  Integral  Theorem  [26,48].  Let  f(s)  ■  g(s)  + 
jh(s)  be  locally  analytic  in  s  over  a  simply  connected  region  GC  t,  let  C 
be  a  simple  closed  curve  in  G,  and  let  D  denote  the  interior  of  C.  Then 


.-.1 

T, 


si 


/  f (s)ds  -  0  .  .  (11.56) 

C 


One  way  of  proving  (11.56)  is  to  use  the  Cauchy-Riemann  Equations  (9.1)  and 
Stokes’  Theorem  [49,  pp.  105-106].  Thus 
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/  f(s)ds  =  /  dfAds  ,  (11.57) 

C  D 

where 

dfAds  =  dx  +  — ■  dy^  A(dx  +  j  dy) 

+  dxAdy 

*  o 

by  (9.1).  When  applied  to  the  nonanalytic  function  log  a  +  j  0,  Stokes' 

Theorem  yields 

/  (log  a  +  j  8)ds  *  /  dlogoAds  +  j  dGAds 

C  D  (11.58) 

-§)  +  3  +  IS)]  d*Ady- 

If  9  is  given  by  (11.39),  then  the  generalized  Cauchy- Riemann  equations 

(11.48)  show  that  (11.58)  does  not  generally  equal  zero.  Thus  the  same 

i  0 

motion  of  singular  subspaces  which  prevents  creJ  from  satisfying  the  Cauchy- 
Riemann  equations  also  prevents  Cauchy's  Integral  Theorem  from  being  satisfied. 
Similar  remarks  hold  for  the  function  aeJ  k,  with  9^  defined  as  in  (11.37)  and 
generalized  Cauchy-Riemann  equations  given  by  (11.47). 

Since  each  element  of  a  transfer  function  matrix  M(s)  written  in 
standard  coordinates  is  locally  analytic,  it  follows  that,  under  the  conditions 
on  C  given  prior  to  (11.56), 

/  M  ds  -  0 
C 


(11.59) 


E 


Equivalently, 


dMAds  -  0  ,  Vs  £  d 


(11.60) 


Condition  (11.60)  is  the  key  to  seeing  how  the  property  of  analyticity 
generalizes  to  nonstandard  coordinates.  Write  M  in  terms  of  its  singular 
value  decomposition 


M  «  VZU11 


(11.61) 


Then 


0  =■  dMAds  =*  (dVAds)ZUF  +  VCdZAds)^  +  VZ(dUHAds) 
[dVZU1*  +  VdZUH  +  VZdUH]Ads  -  0 


Z”XdZAds  »  -  [dlAj  +  Z**1^ dVZjAds 


(11.62) 


dZ  SZ  2 

By  writing,  e.g.,  dZ  =  —  dx  +  —  dy,  (11.62)  reduces  to  2n  partial 

differential  equations  which  must  be  satisfied  by  the  singular  values  and 

vectors  of  M(s) .  For  example,  writing 


diag[a. ]  ,  a,  >  a0  >...>  a 
l  l  —  i.  —  —  n 

[vi  i  •••  :  vJ 


(11.63) 


[ul  !  •••  !  un] 


yields  (assuming  n*2  for  simplicity) : 


dlogc^  0 


,  H  .  H 
du^  u^  du^ 


II,  2  H.  1 

v,  dv,  —  v  dv 

11  0^12 


0  dlogo. 


,  H  .  H  1  H,  H, 

du2  ux  du2  u2  —  v2  dv,  v2  dv2 


(11.64) 


Taking  real  and  imaginary  parts  of  the  four  components  of  (11.64)  yields  the 
eight  desired  equations.  If  so  desired,  coordinates  may  be  assigned  to  the 
singular  vectors.  Then  the  diagonal  elements  of  (11.62)  yield  generalized 
Cauchy-Riemann  equations  relating  the  singular  values  to  measures  of  phase 
difference  between  the  left  and  right  singular  vectors. 

Note  also  that  the  same  technique  may  be  used  to  obtain  differential 
equations  which  must  be  satisfied  by  other  decompositions  of  a  transfer  matrix. 
Examples  are  the  polar  decomposition  [14]  and  eigenvalue-eigenvector 
decompositions . 

2 

Note  also  that  2n  equations  analogous  to  the  Laplacian  may  be 
derived  using  the  "d"  and  the  operators.  From  (11.61)  it  follows  that 

dM  -  dVZU®  +  VdZUH  +  VZdU® 

*  dMU  -  V^VZ  +  dZ  +  ZdU^ 

=•  I-^dMU  =  Z-1VHdVZ  +  Z_1dZ  +  dU*^ 

uV  -1dMU  =  l'  "1VH  dVZ  +  Z_1dZ  +  dU^U  (11.65) 

assuming  that  M  A  exists.  Note  that  the  diagonal  elements  of  Z  “dl  are 
given  by 

31ogo.  Sloga 

dlogai  -  1  3x  dx  +  — ^ —  dy  .  (11.66) 

Applying  *  to  (11.66): 

31oga  31oga 

*d(loga  )  - r— 1  dx  +  — — —  dy  .  (11.67) 


Applying  d  to  (11.67): 


(  3  logo.  3  logo.  \ 

d*d(loga. )  -l  - ~  + - )  dxAdy  .  (11.68) 

'  3x  3y  ' 

Since  M(s)  satisfies  Laplace's  equation,  taking  d*  of  (11.65)  yields  a 

matrix  equation  which  must  be  satisfied  by  the  second  partial  derivatives 

of  the  singular  values  and  vectors.  In  particular,  the  real  parts  of  the 

2 

diagonal  elements  of  the  equation  give  formulas  for  7  loga^. 


11.4.  Summary 

The  purpose  of  this  chapter  has  been  to  explore  the  mathematical 
structure  of  the  differential  equations  governing  singular  values  and  vectors. 
It  was  shown  that  geometric  ideas  lend  themselves  readily  to  the  analysis 
of  properties  of  these  equations  whose  study  was  originally  motivated  by  the 
thought  experiments  in  Chapter  9. 

Finally,  it  should  be  pointed  out  that  the  differential  equations 
discussed  in  Chapter  9  and  in  this  chapter  group  themselves  into  two  distinct 
sets.  One  set,  which  will  be  termed  analysis  equations  (e.g.,  (9.12),  (9.13), 
(9.19),  (9.20),  and  (9.25)),  may  be  used  to  obtain  the  derivatives  of  singular 
values  and  vectors  in  terms  of  derivatives  of  the  transfer  function  matrix. 

Thus  they  are  useful  in  determining  the  properties  of  a  specific  matrix. 

The  other  set  of  equations  (e.g.,  (11.43),  (11.44),  (11.51),  (11.52), 
(11.53),  and  (11.55)),  which  will  be  termed  design  equations,  give  relations 
among  the  derivatives  of  the  singular  values  and  vectors  which  must  be 
satisfied  by  any  matrix  transfer  function.  Thus  these  equations,  in 


principle,  play  the  same  role  as  do  the  Bode  integral  relations.  They  constrain 
the  behavior  of  physical  systems  and  thus  limit  the  properties  which  may  be 
achieved  in  feedback  design.  The  difficulty,  of  course,  is  that  it  is  very 
hard  to  get  any  insight  from  the  differential  equations.  Thus  in  the  next 
chapter  a  generalization  of  Bode  gain-phase  relations  to  multivariable  system 
is  pursued. 


-v  -t  -.  "V  -m  •  •  -  ■-  w  - 
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CHAPTER  12 


GENERALIZATION  OF  BODE  GAIN-PHASE  RELATION 
TO  MULTIVARIABLE  SYSTEMS 


12.1.  Introduction 


l* 


In  the  previous  chapters  the  concept  of  a  local  trivialization 
has  been  used  to  express  the  left  and  right  singular  vectors  in  coordinates. 
The  resulting  sets  of  parameters  each  included  a  measure  of  phase  difference 


between  the  left  and  right  singular  vectors.  In  Chapter  8,  an  example  was 


J9 


presented  which  suggested  that  the  function  oeJ  could  be  related  to  feedback 
properties  in  much  the  same  way  as  could  a  scalar  open  loop  transfer  function. 
The  relation  between  a  and  0,  however,  was  clearly  not  that  between  the  gain 
and  phase  of  a  stable  minimum-phase  rational  transfer  function.  This  motivated 
the  study  of  just  how  these  two  functions  were  related.  Towards  this  end, 
differential  equations  analogous  to  the  Cauchy- Riemann  equations  were  derived. 
These  equations  showed,  in  principle,  that  9  and  a  were  related  as  a  function  of 
the  singular  subspaces.  It  was  not  clear,  however,  that  such  generalized 
Cauchy-Riemann  equations  provide  much  insight  into  the  nature  of  this  relation. 
Motivated  by  the  Bode  gain-phase  relations  for  scalar  feedback  systems,  which 
provide  more  insight  than  the  scalar  Cauchy-Riemann  equations,  one  is 
naturally  led  to  ask  whether  analogous  integral  relations  may  be  obtained  for 
relating  0  and  log  o.  The  purpose  of  this  chapter,  is  to  provide  such  a 
generalization  and,  in  particular,  to  study  the  effect  of  motion  of  the 
singular  subspaces. 


■1 


id 


.■'«n 
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12.2.  Integral  Relations  Among  Singular  Values,  Phase  Differences, 
and  Singular  Subspaces 


In  Chapter  11  the  one-form 


3  log  a. 


3  log  o, 


_  ,  H  3M  .  ,  .  j_  T  r  H  311  ,  . 

Im[vi  «  "ii*1  +  z  Imtvi  I?  uiidy 


.  H  3vi  H  3ui,  .  H  3Ti  H  3ul, . 

’  tvi  ~ ui  +  tvi  ~ui  17!dy 


(12.1) 


was  studied.  It  was  shown  that  (12.1)  relates  the  singular  values  to 

various  measures  of  phase  difference  between  the  singular  vectors.  Assume 

that  both  the  left  and  right  singular  vectors  may  be  assigned  phase  coordinates 

using  the  local  trivialization  T  ,  as  in  (10.5).  (For  the  remainder  of  this 

z 

chapter,  the  subscript  "i"  used  in  (12.1)  to  distinguish  among  singular  values 

will  be  suppressed  unless  a  specific  singular  value  or  singular  vector  pair 

is  being  discussed.)  Then,  using  T  ,  the  left  and  right  singular  vectors 

z 

may  be  written  as  in  (10.8) 


v(s)  »  oz([v(s)])e^9z^V^S^ 
u(s)  -  p  ([u(s)])eJ9z(u(s))  , 


(12.2) 


where  in  this  local  trivialization  the  phase  difference  is  given  by 


A8  (s)  *  9  ( v ( s ) )  -  0  (u(s)) 
z  z  z 


(12.3) 


Two  examples  of  local  trivializations  were  singled  out  in  Chapters 

10  and  11.  The  first  used  one  of  the  standard  basis  vectors  to  define  the 

H 

phase  of  a  vector  w  by  9^(w)  m  as  (10.10).  The  second  used  a  right 

H 

singular  vector  u  to  define  phase  as  6y(w)  ■  ^u  w  as  in  (10.34). 


Jl .  .*».  A*.  A. a .  . .  —A  a.,. 


? 
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In  the  remainder  of  this  chapter,  the  subscript  "z"  will  be 
suppressed  in  (12 .2) -(12 .3)  unless  a  specific  local  trivialization  is  being 
discussed.  Given  any  local  trivialization,  writing  the  singular  vectors  in 
terms  of  the  local  sections  and  phase  difference  coordinates  as  in  (12.2) 
and  substituting  into  (12.1)  yields: 


3A0  3A8  . 

+  -57 dy 


3  log  a  j  ,  3  Iok  o  .  ,  ,r  H  3°v  H  3°u, 

— 5T“  &  +  — 5T“  dy  +  3[  PV  IT-  PU  “ld!< 

.  , ,  H  5pv  H  3°u, 

+  31  p,  17'  pu  I7,dy 


(12.4) 


where  Pv“p[v(s)],  p  *p[u(s)],  and  A8*A8(s). 

In  principle  (12.4)  may  be  used  to  calculate  A0 (jcu)  as  a  function 
of  the  singular  values  and  singular  subspaces;  i.e.,  assuming  that  the  images 
of  the  left  and  right  singular  subspaces  lie  within  the  domain  of  definition 
of  the  section  p ,  then 


A0(jui) 


(12.5a) 


40(ju) 


p“^»dy-d<3»- 


(12.5b) 


Hence 


A6  (jto)  -  |[A8(-j«)  +A6(j-)]  =  yU 


dy  ♦  f  dy) 

(4)  (0 


.  -<*•  „  3p  „  3o  *  3p  „  3o 

if  (\  17"  5u  ly',dy  +  >  (cv  IT'  p„  T^>dyl-  (12-6> 


If  oe  were  a  stable  minimum  phase  proper  rational  transfer 
function,  or  if  the  singular  subspaces  were  constant,  then  the  second  term 


on  Che  right  hand  side  of  (12.6)  would  not  be  present.  Thus  (12.6)  would 

show  that  the  value  of  A6  is  completely  determined  by  a,  and  Cauchy's 

1 A9 

integral  theorem  could  be  applied  to  show  the  function  aeJ  must  satisfy 
a  Bode  gain-phase  relation  as  in  Theorem  2.2. 

In  general,  however,  oe^®  is  not  analytic  and  Cauchy's  Integral 
Theorem  cannot  be  applied.  However,  an  integral  equation  analogous  to  the 
Bode  gain-phase  relation  may  still  be  derived  using  Stokes'  Theorem. 

First,  the  behavior  at  infinity  of  the  singular  value  decomposition 
will  be  discussed.  Generally,  the  singular  values  will  not  be  distinct  at 
s  =  <*>;  in  fact,  in  many  cases  of  interest  they  will  all  be  equal  to  zero.  The 
followsin  Lemma  states  conditions  under  which  it  remains  possible  to  uniquely 
define  one-dimensional  singular  subspaces  at  s=°°. 

Lemma  12.1:  Let  the  matrix  M(s)  taking  values  in  an*n  have  entries  which 
are  proper  rational  transfer  functions.  Assume  that  detM(s)  ^0  and  that 
the  singular  values  of  M(s)  are  distinct  in  the  closed  right  half  plane 
(CRHP) .  (Note  this  implies  that  the  singular  values  may  be  ordered  so 
that,Vs£CRHP,  a^[M(s)  ]  >  02[M(s)  ]  >  •••  >an[M(s)].).  Then  there  exist 
constants  k^  >  0  and  >  0  such  that  for  any  a  E  [  — tt/ 2 ,  tt/ 2 ] 

k  . 

lim  R  xc  [M(ReJ  )]  *  c  .  (12.7) 

R-o°  1  1 

Next  assume,  Vi  and  j,  that  if  k± *  kj ,  then  c± t  .  Then  Vi  there  exists  a 

pair  of  unit  vectors  w^  and  z^,  lying  in  uniquely  defined  one-dimensional 

subspaces  and  2^,  such  that  for  any  a€  [-u/2,tt/2] 

k 

c  -  lim  (Reja)  VWejct)z  .  (12.8) 

R-*-“  1 


Proof  :  See  Appendix  I. 


Suppose  that  the  vectors  w^  and  z^  in  (12.8)  exist,  and  that  these 
vectors  both  lie  in  the  domain  of  definition  of  some  local  trivialization 
T.  Then,  by  continuity,  there  exists  a  neighborhood  of  infinity  such  that 
the  left  and  right  singular  vectors  v^  and  u^  also  each  lie  within  the 
domain  of  definition  of  T.  LetAS^.  be  the  phase  difference  (12.3)  between 
v^  and  ui  as  measured  in  this  trivialization.  Then  as  s  +  ®  along  the  ray 
Re^a,  a€  [-tt/2,tt/2] 


)±  -*•  0(w£)  -  9(zi)  -  k^a. 


This  follows  from  (12.8)  since,  as  R  the  right  singular  vector  may  be 

chosen  so  that  u^(Re^a)  -*■  z^ ;  the  left  singular  vector  must  then  satisfy 
vi(Rej°‘)  -  (-~-)ki  wt  -  (^-)Rki  Mtte^z^ 

Theorem  12.2:  Let  the  matrix  M(s) ,  caking  values  in  (En*n,  have  entries 
which  are  proper  rational  transfer  functions.  Assume  at  each  point  in  the 
closed  right  half  plane  (CRHP)  that  M(s)  satisfies 

(i)  M(s)  is  analytic 

(ii)  det  M(s)  f  0 

(iii)  M(s)  has  n  distinct  singular  values. 

Note  that  at  each  point  s£CRHP  there  exists  a  local  trivialization  whose 
domain  of  definition  includes  both  members  of  the  ith  pair  of  singular 
vectors  u^  and  v^. 

(iv)  Assume  that  there  exists  a  local  trivialization  T  such  that, 
VseCRHP,  u^(s)  and  v^(s)  both  lie  within  the  domain  of 
definition  of  T. 
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Finally,  assume 

(v)  in  Lemma  12.1,  if  k^  =  k^  ,  i f  j ,  then 
(vi)  the  domain  of  definition  of  the  local  trivialization  T  given 
in  (iv)  also  includes  the  vectors  w^  and  z^  given  by  Lemma 
12.1. 


Let  T  be  a  local  trivialization  satisfying  (iv)  and  (vi) .  Note  that  at 

each  point  sGCRHP  the  left  and  right  singular  vectors  and  the  phase  difference 

corresponding  to  the  singular  value  a  may  be  written  as 


j0 

J  V  A 

j  6(v) 

p  e  = 

V 

p([v])e 

jeu  A 
p  e  = 

u 

jS(u) 

p([u])e 

A0  -  0(v)  -  e(u) 


(12.9a) 

(12.9b) 

(12.9c) 


Assume  the  local  trivialization  has  the  property  that  o([w])  =  p([w]). 

Then  cexp[jA9]  is  conjugate  symmetric,  A9(0)  =  kir ,  k  an  integer,  and  for 

each  u  >  0 
o 


A9  (j<JJQ) 


-  A0 (0) 


{log  coth 


H 

P 


u 


3d 

t — ~) dr Ada 
3r 


(12.10) 


v  =  log(f-) 
o 


(12.11) 


Proof :  See  Appendix  I. 


Theorem  12.2  shows  that  the  measure  of  phase  difference  A6(jw)  may 
be  computed  from  knowledge  of  the  associated  singular  value  and  singular 

* 

These  assumptions  may  be  relaxed. 
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subspaces.  Thus  In  one  respect,  A0 (jui)  is  similar  to  the  phase  of  a  scalar  ^ 

transfer  function.  Once  enough  of  the  other  properties  of  a  system  are 
specified,  there  is  no  freedom  remaining  to  specify  A6(ju).  There  does 
exist  an  important  difference,  however.  The  phase  difference  A9(jw)  is  a  v* 

function,  not  only  of  the  gain  o  along  the  joi-axis,  but  also  of  the  way  in 
which  the  input  and  output  directions  vary  over  the  right  half  plane. 

An  Interpretation  of  the  Integrand  of  the  second  Integral  in  (12.10) 
will  now  be  developed.  Consider  the  differential  form  A.^  in  (12.1)  evaluated 
along  a  curve  y(t)€C.  Then,  suppressing  the  subscript  "i"  used  to  distinguish 
among  singular  values,  this  yields 


it 


-31ogq  dx 
3y  dt 


+ 


aiofio  d£i 

3x  dtJ 


ir  r  HdM  , 
a  Imfv  ^u] 


H  dv  H  du 
dt  dt 


(12.12) 


The  discussion  will  proceed  by  showing  how  the  right  hand  side  of  (12.12)  is 

related  to  the  integrand  of  the  second  integral  in  (12.10).  First,  given  an 

H  dw 

arbitrary  unit  vector  w,  the  meaning  of  the  term  w  (the  component  of  the 
derivative  of  the  vector  lying  along  the  vector  itself)  will  be  explored. 

Let  the  unit  vector  in  question  be  written  as 

j9(t) 

w(t)  *  p(t)e  (12.13a) 

where 

p(t)  -  p([w(t)D  (12.13b) 


is  the  local  section  corresponding  to  the  local  trlvialization  used  to  assign 
the  measure  of  phase 


9 (t)  -  e(v(t)> 


(12.13c) 


•It1!*:  f*  cr: 


w  dt  ■ p  t* i  di 


(12.14) 


To  explore  the  significance  of  (12.14),  first  recall  that  a  unit  vector 

defined  along  a  curve  through  ®Pn  ^  is  said  to  be  "minimum  energy"  if 
H  dw 

w  —  =  0.  This  condition,  in  turn,  implies  that  the  vector  changes  along 
at 

the  curve  by  only  the  amount  needed  to  remain  in  the  appropriate  subspace  of 


Consider  first  the  case  in  which  the  subspace  spanned  by  w(t)  is 

constant.  Since  the  local  section  p  is  a  function  of  (and  only  of)  this 

subspace,  it  follows  that  «  0.  Thus,  in  this  case,  the  "excess  energy" 

at 

H  dw 

measured  by  a  nonzero  value  of  w  —  translates  completely  into  a  change  in  the 

at 

phase  9.  In  this  way,  the  vector  w(t)  is  analogous  to  a  scalar  allpass 
function  (i.e.,  a  function  whose  magnitude  is  constant  with  t) .  (Note  also 
the  condition  that  the  subspace  spanned  by  w(t)  is  constant  implies  that  w(t) 
moves  along  the  great  circle  of  S^n  1  identified  with  the  circle  of  unit 
vectors  of  Cn  lying  in  the  subspace  spanned  by  w.) 

Now  recall  the  thought  experiments  discussed  in  Chapter  9.  These 


suggested  that  there  exists  a  component  of  the  motion  of  a  unit  vector 


H  dw 


analogous  to  "spinning,"  and  that  the  measure  of  spinning  is  given  by  w 
Hence,  minimum  energy  vectors  are  those  which  do  not  "spin"  as  they  move. 

It  was  further  conjectured  that  the  spinning  of  a  unit  vector  might  be  related 
to  some  measure  of  the  changing  phase  of  the  vector.  Moreover,  when  a  pair 
of  left  and  right  singular  vectors  was  considered,  the  difference  ^7-  uH  ^7, 
interpreted  as  the  difference  in  the  rates  of  spinning  of  the  vectors ,  was 
conjectured  to  be  related  to  some  measure  of  changing  phase  difference  between 


Che  vectors.  In  Chapter  9,  these  conjectures  could  not  be  investigated,  since 
no  rigorous  notions  of  phase  of  a  vector,  or  of  phase  difference  between 
vectors,  were  yet  available.  Using  the  ideas  about  phase  developed  in 
Chapters  10  and  11,  these  conjectures  will  now  be  explored.  The  essential 
equation  is  (12 . 14) . 


Now,  as  has  just  been  pointed  out,  in  the  special  case  where  the 

H  dw 


subspace  spanned  by  w(t)  is  constant,  the  value  of  w  —  is  indeed  equal  to 
the  rate  of  change  of  9,  the  measure  of  phase.  Moreover,  this  conclusion  is 
independent  of  the  trivlallzation  used  to  measure  6.  Recall  next  another 
special  case  introduced  in  Chapter  9,  i.e.,  that  in  which  the  vector  w  is 
minimum  energy  (but  the  subspace  it  spans  is  not  necessarily  constant) .  From 


(12.14)  it  follows  immediately  that  the  measure  of  phase  9  may  change  even  if 


H  dw 

w  —  *  0.  A  necessary  and  sufficient  condition  for  this  to  occur  is  that  the 


H  do 

section  p  not  be  minimum  energy;  i.e.,  that  p  ^  +  0.  Note  another  necessary 


H  dw  d  9 

condition  for  w  —  i1  j  —  is  that  the  subspace  [w(t) ]  not  be  constant.  This 
at  at 


H  do 

is  not  a  sufficient  condition,  however,  since  the  component  p  —  must  also 


be  nonzero. 


H  dw 

Finally,  consider  the  general  case,  in  which  w  —  ^  0  and  the 


subspace  spanned  by  w(t)  also  is  changing  (implying  that  ^7  £  0) .  It  follows 


H  dw 

that  the  value  of  w  ^-measures,  not  the  rate  of  change  of  the  phase  6, 


but  rather  the  rate  of  change  of  the  difference  between  6  and  the  phase  9  of 


a  minimum  energy  vector  w*(t)  traversing  the  same  path  through  tPn  This  may 


be  seen  easily  from  (12.14),  from  which  it  follows  that  the  race  of  change  of 

* 

the  phase  of  w*(t)  is  given  by  j  -  -  p^  — 


dt’ 


,  ,  <16  H  dw  ,  .  d9 

therefore,  j  -  w  j- +  1  ~t~. 


In  the  general  case. 


Note  that  the  value  of  w  is  independent  of  the  local  trivial- 

ization  used  to  define  phase.  The  rate  of  change  of  the  difference 
* 

8-8  ,  however,  is  not  so  independent.  This  is  because  the  rate  of  change 

of  the  phase  6*  of  a  minimum  energy  vector  is  a  function  of  the  local 

trlvialization  used,  as  is  the  value  of  the  component  p®  4^-  of  the  derivative 

dt 

of  the  corresponding  local  section.  Thus,  from  (12.14),  the  value  of  the  rate 

•  jj  ^ 

of  change  of  6  is  equal  to  the  difference  between  the  component  w  —  and  the 

at 

u 

value  of  the  connection  one-form  p  dp  evaluated  on  the  tangent  vector  to  the 
curve  through  <£Pn  ^  traversed  by  [w(t)J.  This  latter  term,  of  course, 
quantifies  the  amount  by  which  the  section  p  itself  deviates  from  being  a 
"minimum  energy"  vector.  Finally,  observe  that  once  a  local  trlvialization 

with  which  to  measure  the  phase  of  w  has  been  chosen,  then  the  discrepancy 

H  dw  d0  «  i 

between  w  —  and  ^  is  a  function  solely  of  the  path  through  (EP11-1  along 

which  w(t)  is  constrained  to  travel  (equivalently,  the  way  in  which  the  sub¬ 
space  [w(t)j  changes.)  This  follows  since  choosing  a  local  trlvialization 
implies  a  choice  of  rule  for  defining  the  local  section  p,  which  in  turn  is 
a  function  solely  of  the  subspace  in  which  the  vector  w  lies.  In  some  sense, 

it  may  be  said  that  changes  in  the  phase  6  arise  from  two  sources.  One 

H  dw 

source  is  the  value  of  the  component  w  — ,  while  the  other  source  is  the 

at 

motion  of  the  subspace  [w(t) ] .  This  latter  source  will  now  be  studied 
further. 

It  is  instructive  to  write  p  in  coordinates  so  that  insight  may  be 
gained  into  how  motion  of  the  subspaces  produces  a  change  in  phase.  Assume 

U 

that  the  local  trlvialization  used  to  compute  9  is  given  by  8  ■  8  (w)  »  *z  w, 

21  1 

where  z^  is  a  constant  unit  vector.  Choose  constant  vectors  i-2,...,n, 

so  that  {z.  ;  i«l,...,n}  is  an  orthonormal  basis  for  <En.  Then 


(12.14) 


P  -  P_  ([w])  -  E  z  (z.p) 
Z1  1-1  1  1 


H 


where  z^p  is  a  positive  real  number.  Thus 


_  J  z  (ZH  ie.) 

dt  .  .  iUi  dt' 
1-1 


(12.15) 


H  dp  “  ,  H  w  Hdp, 

c  a?  ■  ; E,  (o  zi)(2i  dt> 
1-1 


(12.16) 


Write 


^“i  4  H 

rie  “  V 


(12.17) 


2  _ 


where  I  r  = 1  since  {z, }  Is  an  orthonormal  basis  and  p  is  a  unit  vector. 

i-1  H 

Note,  also  that  a^  =  0,  since  -  *Zjp,  and  this  component  of  p  is  positive 


real  since  the  vector  z1  was  used  to  define  the  local  trivialization  for  which 

n  drA 

p  is  the  local  section.  These  facts  imply,  in  turn,  that  Z  r.  -r—  - 

1  d  n  2  dal  i“1 

—  —  Z  r^  -  0,  and  that  -  0.  Thus,  since  the  z^'s  are  constant. 


2  dt  “  ‘i 
i-l 


H  do  d  ,  H  , 
*i  <zi0) 


da. 


dri  J°i  „  -i  ,  * 

IT e  +  rie  J  7T 


H  do 


Jal 


“'w<ri*  Jri' 


J“l  dai, 


dt 


n  dr  ,  da 

‘ri  dT  +  *  ' i  dT1 


H  do  .  ,  °  2  % 

0  dt  '  J  ±l2  rl  dT 


(12.18) 


Let  the  subspace  spanned  by  w  be  denoted  [w] .  Equation  (12.18)  reveals  a 


H 


condition  necessary  for  motion  of  [w]  to  produce  a  change  in  *z  w;  i.e.,  in  the 


phase  of  that  component  of  w  used  to  define  6  via  the  local  trivialization 
under  discussion.  First,  note  that  (12.14)  implies  w  may  be  written  as 


pe' 


je 


“  -H  j9. 
I  z.  (z.peJ  ) 

i-1 


H  “ 

z^w 

H 

rl 

ja 

z2w 

• 

• 

-  z 

r2  6 
• 

• 

• 

H 

N 

r  e 
n 

je 


C12.19) 


where  Z 


[  Z-.  !  • 

1  I 


H  H  H 

.  {  z  ] ,  8  ■  *z"w,  and  a.  ■  *z .w - *z,w.  Observe  that 
in  1  i  i  1 


H  H 

each  equals  the  difference  in  phase  between  the  components  z^w  and  z^w. 

jQl 

(Hence,  is  identically  equal  to  zero  and  e  = 1.)  Next,  note  that 
(12.13)  and  (12.18)  taken  together  imply 

H  dw  d6  “  2  dai 

w  «  *  1  dF  +  J  ri  dT 

,  j  °  r2di+J  J  r2f!i 

J  l£l  1  dt  J  i-2  1  dt 


“  2.d0  d“i. 


(12.20) 


where  the  last  equality  holds  since  is  constant.  Equation  (12.20)  reveals 

a  condition  necessary  for  the  motion  of  the  subspace  [w]  to  cause  the  value 
d  0  H  dw 

of  -jr  to  deviate  from  that  of  w  — .  The  condition  is  that  the  rate  of  change 
at  at 

U 

of  the  difference  in  phase  between  z.w  and  at  least  one  of  the  other  components 

H  dai 

z.w,  1*2, ...,n,  must  be  nonzero.  That  is,  it  is  necessary  that  -rr-  f  0  for  at 


least  one  value  of  i,  2  <  i  <  n.  Moreover,  note  that  this  condition  is 

necessary  no  natter  what  choice  of  vectors  {z^  ;  i«2,...,n)  is  used  to 

complete  the  orthonormal  basis  in  (12.14).  (Recall  the  vector  is 

determined  by  the  local  trivialization  and  hence  is  fixed.)  Thus,  if  there 

d°i 

exists  any  choice  of  {z^  ;  i"2,...,n)  for  which  ■  0  V  i«2,...,n,  it 

follows  that  pH  4“  ■  0  (i.e.,  p(t)  -  p  ([w(t)])  is  a  minimum  energy  vector) 
at  z^ 

and  the  motion  of  [w]  does  not  affect  6 .  (The  converse  to  this  statement  is 

not  true,  however.  It  may  well  be  that,  for  some  choice  of  basis,  pH  *  0 

due  to  cancellations  among  nonzero  terms  on  the  right  hand  side  of  (12.18).) 

Note,  in  particular,  that  if  the  motion  of  [w]  produces  changes  only  in  the 

relative  magnitudes  of  the  various  components  of  w,  then  no  effect  upon  9 

is  produced.  To  summarize,  a  necessary  condition  for  motion  of  the  subspace 

dai 

[w]  to  affect  the  value  of  0  is  that  3 i»  2  <  i  <  n,  for  which  ^  0.  A 

“  2  dcti 

sufficient  condition  is  that  the  weighted  sum  I  r.  -rr-  t  0. 

i=>2  1  aC 

The  preceding  remarks  may  also  be  viewed  in  terms  of  the  decompose 

tion  of  the  tangent  space  into  horizontal  and  vertical  subspaces,  as 

discussed  in  Chapter  11.  Consider  XS  TwS^n  ^  and  decompose  X  into  its 

V  H 

horizontal  and  vertical  components  as  in  (11.4):  X  ■  X  +  X  .  Evaluating 

H  V 

the  form  w  dw  on  the  vertical  component  X  yields 

H  V  H  V  V 

w  dw(X  )  -  p  dp(ir*X  )  +  j  d9(X  ) 

-  j  d0(XV)  (12.21) 

V 

because  it^X  ■  0  by  the  characterization  (11.3)'  of  the  vertical  subspace. 

Furthermore,  note  that  (11.3)'  implies  77-  ■  0  along  any  curve  whose  tangent 

dt  _  , 

n  jai 

vector  lies  in  the  vertical  subspace.  Thus,  since  p  ■  I  z. (r.  e  ) ,  it 


follows  that  -7—  ■  0  along  such  curves  and  that  w  —  -  1  Z  r,  —  -  1  — 
at  at  .  ,  i  at  at 

i*l 

by  (12.20)  and  the  fact  that  w  is  a  unit  vector.  Consequently,  the  vertical 
component  of  a  tangent  vector  can  produce  only  equal  changes  in  the  phase  of 
each  component  of  w. 

The  horizontal  component  of  a  tangent  vector,  on  the  other  hand, 
can  never  produce  equal  nonzero  changes  in  the  phase  of  each  of  the  components 
of  w.  This  fact  follows  from  the  characterization  (11.2)  of  the  horizontal 
subspace  and  Equation  (12.20).  Together,  these  expressions  yield 


wHdw(XH)  -  j  de(XH)  +  oHdp(irikXH) 

-  j  d0(XH)  +  j  Z  r2.  da.0r.XH) 

i-2  1  1 

-  0  .  (12.22) 


Equation  (12.22)  shows  that  along  a  curve  whose  tangent  vector  lies  in  the 

dQ  <*(*1 

horizontal  subspace,  —  »  0  unless  -7—  #  0  for  at  least  one  a.,  i»2,...,n. 

at  at  1 

From  the  definition  of  a^,  it  follows  that  along  such  a  curve  the  phases  of 
each  component  cannot  change  by  an  equal  nonzero  amount.  Hence,  in  order 
that  X»  produce  a  nonzero  value  of  it  is  necessary  that  the  subspace 
spanned  by  w(t)  change.  Moreover,  from  (12.20),  it  follows  that  the  rates 
of  change  in  phase  of  the  components  are  constrained  by  the  weighted  sum 

da. 


1  +  _ *) 


(12.23) 


H  H 

where  9  *  S-z^w  and  9  +  a^  *  ^z^w,  i*2,...,n. 

There  are  thus  two  mechanisms  for  producing  changes  in  9 , 


corresponding  to  the  decomposition  of  the  tangent  space  into  vertical  and 


horizontal  subspaces.  The  change  due  to  the  vertical  component  X  is 
analogous  to  the  change  one  would  expect  were  w(t)  a  scalar-valued  curve. 
This  is  reasonable  since  curves  whose  tangent  vectors  lie  in  the  vertical 


subspace  span  a  constant  one-dimensional  subspace  of  <Cn.  The  change  due  to 
H 

X  ,  on  the  other  hand,  is  due  solely  to  the  fact  that  w(t)  is  a  vector¬ 
valued  curve.  This  too  is  reasonable  since  changes  in  phase  can  be  produced 
in  this  fashion  only  if  the  subspace  spanned  by  w(t)  is  not  constant.  Since 
the  net  change  in  8  is  due  to  the  sum  of  these  effects,  it  follows  that  the 
discrepancy  between  the  behavior  of  6  and  that  of  the  phase  of  a  scalar 
allpass  function  is  quantified  by  the  term 

d0(XH)  -  -  Z  x2  da . (t*XH)  .  (12.24) 

i-2  1 


An  interpretation  of  (12.24)  is  that  the  horizontal  component  can  cause  phase 
to  be  transferred  from  one  component  of  w  to  another,  although  the  net 
(weighted)  sum  of  the  rates  of  phase  change  must  equal  zero;  i.e., 

n  „  da. 


Z  r^(—  +  - i.) 

^  Vdt  dt ' 


(12.25) 


This  interpretation  will  be  applied  shortly  to  discuss  how  phase  lead  and 
lag  may  be  transferred  among  the  loops  of  a  linear  multivariable  system. 

Consider  again  (12.12),  the  differential  form  in  (12.1)  evaluated 
along  a  curve  y(t)e<E.  The  preceding  discussion  may  be  extended  to  show  how 
this  form  relates  a  singular  value  to  various  measures  of  phase  difference 
between  its  associated  pair  of  singular  vectors.  Let  the  left  and  right 
singular  vectors  be  written  in  a  local  trivialization  as  in  (12.2)-(12.3) . 
Using  the  abbreviated  notation  defined  immediately  following  (12.4)  yields 


V 


(12.26a) 


Equation  (12.27)  Is  the  key  to  understanding  the  relation  among  a  singular 

value,  the  subspaces  spanned  by  the  associated  pair  of  singular  vectors,  and 

various  measures  of  phase  difference  between  these  vectors.  Thus  a  number  of 

questions  originally  posed  in  Chapter  9  may  now  be  resolved.  Note,  in  particular 
1  H  dM 

that  the  term  Im[v  —  u]  is  not  generally  equal  to  the  rate  of  change  of 
cr  at 

the  phase  difference  A9.  Another  way  of  viewing  this  is  to  consider  an 
arbitrary  pair  of  minimum  energy  vectors  lying  above  the  paths  through  CPn  ^ 


traversed  by  the  left  and  right  singular  subspaces .  Let  these  vectors  be 
*  * 

denoted  by  v  and  u  ,  respectively,  where 


(12.28a) 

(12.28b) 


and 


(12.28c) 


is  the  phase  difference  between  the  pair  of  minimum  energy  vectors.  From 
the  discussion  of  minimum  energy  vectors  a  few  paragraphs  back,  it  follows 


that  the  phases  of  v  and  u  are  related  to  the  sections  p  and  p  via  the 

v  u 


rules 


d6  „  dp 

.  _ v  H  v 

^  dt  pv  dt 


d9  „  dp 

.  u  H  u 

^  dt  Pu  dt 


(12.29a) 


(12.29b) 


Hence,  the  difference  in  the  phases  of  the  minimum  energy  vectors  is  related 


to  the  sections  by 


,  H  H  dpu, 

(pv  dt  '  pu  dt  ) 


(12.30) 


Using  (12.30)  in  (12.27)  yields 


dA0  1  r  H  dM  ,  .  dA0 

_  -  _  Im[v  —  u]  +  — 


,-31oga  dx  91ogq  dy-,  dA8 
9y  dt  dx  dt  dt 


(12.31) 


Equation  (12.31)  shows  that  the  term  —  Im[v^  4^  u]  determines,  not  44^-,  the 

o  dt  dt 

rate  of  change  of  phase  difference  between  the  associated  pair  of  singular 

* 

vectors,  but  rather  the  discrepancy  between  ~r—  and  — : — ,  the  rate  of  change 

at  at 

of  phase  difference  between  a  pair  of  minimum  energy  vectors  traversing  the 
paths  U(t)  and  il(t)  throuch  GPn  ^  traversed  by  the  left  and  right  singular 
subspaces.  In  the  scalar  case  the  latter  term  is,  of  course,  not  present, 
and  the  usual  relation  between  gain  and  phase  difference  is  obtained.  In  the 
general  multiple  loop  case,  the  relation  between  the  measure  of  gain  given  by 
a  singular  value  and  the  measure  of  phase  difference  given  by  A6  is  influenced 
by  the  difference  in  the  two  paths  0(t)  and  U(t) .  Recall  the  singular  subspaces 
are  constrained  to  traverse  these  subspaces  as  a  consequence  of  the  min-max 
property  (6.15). 


■  •  ■  .  ■  .  *  .  ■  «>  ,  „•  /  „•  .•  .  ,  .  •  .  •  . 
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It  is  instructive  to  write  the  local  sections  p  and  p  in  coor- 

v  u 

dinates  so  that  insight  may  be  gained  into  how  motion  of  the  singular  subspaces 

affects  the  phase  difference  A0.  As  in  the  discussion  of  Equations  (12.14)- 

(12.20),  let  the  local  trivialization  used  to  compute  phase  difference  be 

H 

given  by  0_  (w)  =  34z,w,  where  z,  is  a  constant  unit  vector.  Hence,  from 
Z1  1  1 

(12.26),  A0  =  0  -  0  ,  where  0  =0  (v)  =  ^zfv  and  0  =0  (u)  =  *z!?u.  Again, 

v  u  v  z^  1  u  z^  1  ° 

choose  constant  vectors  z^,  i=2,...,n,  so  that  (z^  ;  i=l,...,n}  is  an 
orthonormal  basis  for  (1°.  Then 


p  =  p  ( [v] )  =  Z  z . (z .  p  ) 
v  z.  1  i  i  v' 


(12.32a) 


p  =  p  ( [u] )  -  Z  z. (z.p  ) 
u  z,  1  .  ,  i  l  u 

1  1=1 

H  H 

where  z^Pv  and  z^pu  are  positive  real  numbers.  Thus,  writing 


(12.32b) 


Jaiv  A  H 
riv  6  =  ZiPv 


(12.33a) 


JCliu  A  H 
r .  e  =  z.p 
iu  i  u 


(12.33b) 


yields,  as  in  (12.18) 


.  ;  2  daiv 

Dv  dt  ^  -  riv  dt 

1=2 


(12.34a) 


h\  .  ;  2  d°iu 

°u  dt  d  ^^2  r^u  dt 


(12.34b) 


Consequently,  as  in  (12.20), 


H  dv  a  du  .  dA9  H  dSv  H  d"u 

v  ir'u  d?':i7r  +  Dvir-cu'dr 


J\ 


J  df  +  j  ±i2  triv  It  ‘  riu  ~dF“J 

r  a  logo  dx  3  logo  d^i 

-11"  3y  dt  3x  dtJ 


(12.35) 


Equation  (12.35)  reveals  a  necessary,  but  not  sufficient,  condition  for  motion 

d  A0 

of  the  singular  subspaces  to  cause  the  value  of  — to  deviate  from  that  of 

-  d-*-°8a  +  31ogo  ^  (in  other  words ,  to  cause  the  relation  between  o  and  A6 

3y  dt  3x  dt 

to  deviate  from  that  between  scalar  gain  and  phase) .  This  necessary  condition 

.  da.  0  da 

.  ,  t  2  iv  2  iu.  ,  _ 

is  that  one  or  more  of  the  terms  [r.  —rr - r.  — r-  j  be  nonzero.  For 

iv  dt  iu  dt 

example,  if  motion  of  the  singular  subspaces  produces  only  changes  in  the 

relative  magnitudes  of  the  components  of  each  singular  vector,  so  that 
da^v  ^aiu 

— - —  ■  — ■  ■  u-  =  0,  i“2,...,n,  then  the  relation  between  a  and  A0  is  not  affected, 
dt  dt 

Furthermore,  suppose  the  magnitudes  of  the  ith  components  of  the  singular 

vectors  are  the  same  (so  that  r  *  r.  ,  i*2,...,n)  and  that  the  derivatives 

daiv  daiu 

of  the  coordinates  a  ^  and  a^  are  the  same  (-  ■  "dt”"’  i=2,...,n).  In  this 

case  also  it  follows  that  motion  of  the  singular  subspaces  does  not  affect 
the  relation  between  a  and  A9.  Finally,  note  in  particular  that  if  pv  =  Pu» 
so  that  the  singular  vectors  are  eigenvectors,  then  riv  =  r^  ,  i=2,...,n  and 

»  aiu,  i«2,...,n.  It  follows  that,  locally,  the  discrepancy  between  a  and  A0 
and  scalar  gain  and  phase  can  be  significant  only  if  the  singular  vectors  do 
not  well  approximate  eigenvectors  of  M(s) .  This  observation  should  have  some 
significance  for  design  methods  based  upon  eigenvalues  and  eigenvectors  of 
transfer  function  matrices  [33]. 

It  is  instructive  to  view  the  relation  between  changes  in  the 
singular  vectors  and  changes  in  phase  difference  in  terms  of  the  decomposition 
of  the  tangent  space.  Let  y(t)  be  a  curve  in  the  complex  plane,  and  let 
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and  Xu  be  the  tangent  vectors  to  the  corresponding  paths  through  <IPn  ^ 
traversed  by  the  left  and  right  singular  vectors.  Furthermore,  let  these 
vectors  be  decomposed  into  vertical  and  horizontal  components  as 
X^  *  X^  +  X*|  and  Xu  ■  X^  +  X^,  respectively.  Evaluating  the  differential 
form  (12.1)  on  the  tangent  vector  to  y  yields  the  following  string  of 
equalities : 


,  r  31ogq  dx  9  logo  dy,  _  j_ 
^ 1  3y  dt  dt  a 


.  ,  HdH  , 
Im[v  u] 


H  dv  H  du 

m  v  *T“  —  U 

dt  dt 

-  V’Ov^.V  +  J  d0V(Xv> 


-  VV’.V  -  i  d9„<V 

H  V  H  V 

=  p  dp  (tt  .X  )  -  p  dp  (tt.X  ) 

+  [pHdp  (ir  ,XH)  +  j  d0  (XH)  ] 

-  [pHdp  (tt ,XH)  +  j  d0  (x11)  ] 

L  u  u  *  u  u  u 

+  j  d0  (XV)  -  j  d0  (Xv)  .  (12.36) 

J  V  v  u  u 

Recalling  the  discussion  of  Equation  (12.21),  it  is  seen  that  the  first  two 
terms  on  the  right-hand  side  of  (12.36)  equal  zero.  Moreover,  recalling  the 
discussion  of  Equation  (12.22),  the  second  two  terms  are  also  seen  to  equal 
zero.  Thus,  (12.12)  reduces  to 
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-jr-  H2S£  *c  +  n££o  dxj  ,  i  ,  H  j 
3y  dt  3x  dt J  0  1  dt  J 

H  dv  H  du 
V  dt  “  U  dt 

■  Jtdev(xj>  “  d0u(x][)]  .  (12.37) 

Equation  (12.37)  shows  quite  clearly  that  the  form  j[-  dx  +  dy] 

falls  to  completely  describe  the  behavior  of  the  phase  difference  A0.  In 
particular,  this  form  provides  no  information  about  the  effect  motion  of  the 
singular  subspaces  has  upon  phase  difference.  This  effect  is  determined  by 
the  horizontal  components  of  the  tangent  vectors  and  is  given  by 

-  c”d»u("*^))  •  (12.38) 

To  summarize,  the  vertical  components  of  the  tangent  vectors 
produce  changes  in  phase  difference  entirely  analogous  to  the  scalar  case. 

This  is  because  these  components  do  not  affect  the  singular  subspaces.  Note 
also  that  the  effect  the  vertical  components  have  upon  phase  difference  is 
completely  determined  by  the  behavior  of  the  singular  value.  Hence,  the 
resulting  change  in  Ad  does  not  depend  upon  which  components  of  the  singular 
vectors  are  used  to  measure  phase  difference;  i.e.,  upon  the  choice  of  local 
trivlalizatlon  in  the  discussion  preceding  (12.14).  Another  way  of  stating 
this  is  that  the  vertical  components  produce  equal  changes  in  the  phase 
difference  between  any  two  components  of  the  singular  vectors.  The  horizontal 
components,  on  the  other  hand,  which  are  not  present  in  the  scalar  case  (or 
are  identically  zero  if  the  singular  subspaces  are  constant)  are  responsible 
for  aeJ  not  being  analytic,  and  thus  for  the  failure  of  the  scalar  gain-phase 


relations  to  describe  the  behavior  of  this  function.  An  argument  similar  to 
that  preceding  (12.25)  shows  that  the  horizontal  components  may  be  interpreted 


as  causing  phase  tc  be  transferred  among  the  components  of  the  singular  vectors 
subject  to  the  constraints 


1  + 


=  j 

d@  (XH)  +  j  Z  r]  da,  (XH) 
v  v  J  „  iv  iv  v' 

i-2 

-  0 

(12.39a) 

j  d9  (XH)  +  pHdp 
J  u  u  u  u  *  u 

-  j  d0  (XH)  +  j  Z  r]  da,  (XH) 

J  u  u  J  iu  iu  u 

-  0  .  (12.39b) 


The  horizontal  components  of  the  tangent  vectors  are  thus  apparently 
responsible  for  multiple  loop  system  phenomena  having  no  analogue  in  scalar 
systems. 

Note  that  if  the  basis  {z^}  is  that  given  by  the  standard  basis 
vectors  {e^},  then  the  coordinates  r^v  and  r^u  may  be  reparameterized  by 
<J>iv  and  <}>iu  as  in  Chapter  10.  Similarly,  the  coordinates  and  may 
be  replaced  by  and  and  A0  becomes  9^.  (Note,  the  subscript  "1" 

refers  to  the  components  of  v  and  u  used  to  define  phase  difference,  and  not 
to  any  ordering  of  the  singular  values.) 

Now  recall  that  the  standard  basis  vectors  correspond  to  the 
physical  loops  of  the  system.  The  above  discussion  shows  that  it  is 
possible  to  Interpret  the  failure  of  oe  to  satisfy  the  scalar  Bode 
gain-phase  relations  as  being  due  to  phase  lead  or  lag  being  transferred 


to  the  first  loop  of  the  system  from  other  loops.  Moreover,  transfer  of  phase 
lead  or  lag  must  necessarily  be  present  if  the  gain-phase  relations  are  to  be 
violated.  The  additional  lead  or  lag  produced  in  6^  via  loop  interactions  must 
be  accompanied  by  lag  or  lead  being  produced  in  loops  of  the  system  other  than 
the  first.  Obviously,  this  discussion  may  easily  be  modified  to  apply  when 
the  ith  components  of  v  and  u  (written  in  the  standard  basis)  are  used  to 
measure  the  phase  difference  0^. 

Note  how  coupling  between  loops  manifests  itself  in  the  integral 

relation  (12.10).  The  value  of  A0(ju)o)  deviates  from  the  value  it  would  attain 

if  the  singular  subspaces  were  constant  as  a  result  of  phase  being  transferred 

among  loops  of  the  system.  The  net  effect  of  this  phase  transfer  upon  A0(jwo) 

is  obtained  by  averaging  the  value  of  the  effect  contributed,  by  integrating  the 

H  ^v  H  ^u 

function  governing  phase  transfer,  p  -r—  -  p  -r— ,  along  each  ray  extending 

v  dir  u  dr 

from  juo  to  the  point  at  infinity. 

The  discussion  of  the  preceding  two  paragraphs  was  based  upon  local 
trivlalizations  using  constant  vectors  to  define  phase.  As  discussed  in 
earlier  chapters,  a  more  physically  meaningful  definition  of  phase  utilized 
the  right  singular  vectors,  which  are  not  constant.  In  the  examples  discussed 
in  this  thesis,  this  fact  presents  no  difficulty  since  the  right  singular 
subspaces  are  approximately  constant  and  equal  to  the  standard  basis  directions 
Thus  the  above  analysis  can  be  applied.  In  general,  this  will  not  be  true. 

Thus  the  above  discussion  needs  to  be  extended  to  the  general  case,  although 
this  will  not  be  pursued  in  this  thesis.  Instead  of  coupling  between  physical 
loops  of  the  system,  as  in  the  case  when  the  standard  basis  vectors  are  used 
to  define  phase,  coupling  between  input  and  output  directions  given  by  the 
singular  vectors  will  appear  in  the  equations. 


•Mi 


One  way  of  transforming  the  differential  equations  involving  the 
„  H 

phases  0^  into  those  involving  8  «  v  is  to  use  the  functions  g^^  as  in 
(11.51)-(11.52) .  Since  the  two  definitions  of  phase  difference  are  related 
by  an  exact  form,  however,  it  is  simpler  to  just  use  g^,  given  by  (10.41), 
directly  in  the  integral  relation  (12.10).  To  illustrate,  consider  (12.10) 
in  coordinates  for  the  case  n-2.  Using  coordinates  on  the  ith  local  section 
(10.17)  to  write  the  ith  pair  of  singular  vectors  in  coordinates  yields 


®i(^o>  -  v°>  -  7  /  {iog  coth  4^}dv 


(-!) 


7~  /  (sin  *v  *IF‘sin  ^u  -aF)drAda 
CRHF 


(12.40) 


Using  the  function  gi  in  coordinates  (10.43)  yields 

9  (jtu  )  *  9i  (ju  )  +  Mcos  <(>  cos  <t>  +  sin  $  sin  4>  e 
J  o  1  J  o  u  V  u  V 


S  <  VV 


and 


o 

(12.41a) 


*u) 

0(jwQ)  *  ®2^aJo^  +  Mcos  <f>u  cos  4>v  e  +  sin  <]>..  sin  4>„] 


u 


o 

(12.41b) 


Finally,  recall  that  the  scalar  Bode  gain-phase  relations  contained 


a  weighting  function  showing  that  the  dependence  of  phase  upon  gain  decreased 
rapidly  away  from  the  frequency  in  question.  Note  the  same  weighting  function 
appears  in  the  first  integral  in  (12.10). 


It  will  now  be  shown  that  the  second  integral  in  (12.10)  impli¬ 


citly  contains  a  weighting  function.  This  follows  since  the  element  of 
surface  area  in  the  right  half  plane  is  given  by  rdrAda.  Thus  the  integral 
may  be  rewritten  as 


If  (  H  3pv  H  3pu.  ,  1  r  1  ,  H  3pv  H  3puw  ,  ..  , 

7  /CMP(,>V  17 '%  17 >dr,'dc‘  ■  7  /CRHp  7  <%  -57-%  17 >(rdrAd“> 


(12.42) 


Writing  the  integral  in  this  way  shows  that  the  effect  of  the  motion  of  the 
singular  subspaces  at  a  point  in  the  right  half  plane  upon  the  value  of 
9(juQ)  decreases  as  the  inverse  of  the  distance  from  the  point  to 
Thus  coupling  among  loops  is  most  effective  in  altering  the  value  of  9 ( j ooq) 
if  this  coupling  takes  place  near  the  frequence  ju)Q. 


12.3.  Summary 

In  this  chapter  the  differential  equations  generalizing  the 
Cauchy-Riemann  equations  have  been  translated  into  generalized  gain-phase 
relations  via  Stokes’  Theorem.  Some  insights  into  implications  for  system 
properties  were  discussed.  Much  work  remains  to  be  done;  this  is  discussed 
in  the  conclusion  section  of  Chapter  13. 


CHAPTER  13 


ANALYSIS  OF  AN  EXAMPLE  USING  NEW  INTEGRAL  RELATIONS 


13.1.  Introduction 

la  SecClon  8.4  an  example  of  a  two-input  two-output  feedback  system 
was  discussed.  This  example  exhibited  properties  markedly  different  from 
those  of  a  scalar  feedback  system.  In  particular,  it  was  found  that  the  gain 
in  one  loop  could  be  rolled  off  at  40  db/decade  near  crossover  while  the 
sensitivity  function  remained  bounded  less  than  10  db.  In  addition,  a  new 
tradeoff  between  system  properties  in  different  frequency  ranges  was  observed. 
It  appeared  that  the  anomalous  behavior  of  this  example  was  due  to  some  sort 
of  interaction  between  the  loops  of  the  system,  although  no  theoretical  basis 
for  this  hypothesis  was  available.  The  purpose  of  this  chapter  is  to  provide 
an  extensive  analysis  of  this  example  using  the  integral  relation  developed  in 
Theorem  12.2.  By  identifying  parameters  describing  the  singular  value 
decomposition  of  L(s)  with  the  individual  components  L^ (s)  the  notion  of 
helpful  loop  interaction  will  be  quantified.  A  tentative  explanation  of  the 
behavior  of  this  example  will  be  presented. 


13.2.  Preliminary  Analysis  of  Example 

In  this  chapter  the  feedback  system  with  open  loop  transfer  function 


150  x  1CT 
(s  +  10) 

-7071  x  103(s  +  .1 


(s  +  10) 


(s  +  .01)' 


(s+  .01) 


(13.1) 
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will  be  studied.  It  may  be  verified  that  the  feedback  system  with  sensitivity 
function  S(s)  *  1/(1+L(s))  is  stable. 

The  gain  in  the  second  loop  of  this  system  (which  is  approximately 

_ -1  _ _ I _ 1 _  I  \  i  _  1 _ _ . _ f  .  _  rtl _ 1  l _  _ I  _ r. _ _  I _ .  *  . _ 


equal  to  | - ~|)  *-s  large  until  id  ■  .01  rad/sec,  and  thereafter  decreases  £?| 

(s+.oi) 

with  a  two-pole  roll-off  (*»  40  db/decade) .  Note  the  gain  in  the  second  loop 
is  approximately  equal  to  one  at  o>  *  1  rad/sec.  The  gain  in  the  first  loop  is 
large  until  u>  *  10  rad/sec  and  thereafter  decreases  with  a  one-pole  roll-off 
(®=  20  db/decade).  In  Section  8.4  several  properties  of  L(s)  and  the  associated 
sensitivity  function  were  noted.  These  will  now  be  examined  in  detail. 

First,  denote  the  columns  of  L(s)  by 


‘Hi  Hi 


11 

L12 

21 

L22 

(13.2) 


Let  the  singular  value  decomposition  of  L  be  denoted  by 
L  -  VIU*1 

■  Ivi  :  h1  pi  0 1  ‘ui  i  h1" 

0 


V11  v12  °1  0  “ll  u12 


V  V 

21  22 


0  a2  u21  u22 


(13.3)  > 


From  the  min-max  property  of  singular  values  it  follows  that,  at 


each  value  of  s. 


$ 


a[L(s)]  >  14(8)1 


(13.4a) 


0[L(s)]  <  | L2 (s) | 


(13.4b) 


It  can  readily  be  verified  that  |L^(s)|  >  |L,2(s)|  for  all  s  in  the  closed 
right  half  plane  (CRHP) .  Thus  the  singular  values  of  L(s)  are  distinct  in 
the  CRHP  and  may  be  numbered  o.  ■  o,  a.  *  a.  Since 


LUi  ■ 


(13.5) 


it  follows  that 


°1V1  -  U11  *  L1  +  U21  *  L2 


(13.6a) 


°2V2  *  U12  '  L1  +  U22  ‘  L2 


(13.6b) 


From  (13.6a)  it  follows  that 


al  -  Kl1  *  iLlS  +  iu21*  *  lL2l 


<  luii!  *  flLJ  +  Sl2I 


(13.7) 


Together  (13.4a)  and  (.13.7)  imply 

,  Hal  < . 

1  ‘  1 41  - 1  ll! 


(13.8) 


Since  |L2(s)jj  <  |L1(s)J  in  the  CRHP,  it  follows  that  (u^l  >  0  there  and, 


since  U  is  unitary,  |u. 


uH  |  .  Thus  Vs  €  CRHP  ui(s)  may  be  written 


using  the  local  trivialization  4  from  (10.26).  (Note  the  correspondence 
between  singular  values  and  the  standard  basis  directions  implies  that 


numbered  subscripts  in  this  section  refer  both  to  the  ordering  of  the  singular 
values  and  to  the  standard  basis  vectors  used  to  construct  the  trivializations . ) 
Next,  (13.6a)  yields 


°1V21  “  U11L11  +  U21L12 


lvii!  2  7. 


luii|.|Lnl  -  |u21I-|l12| 


(13.9) 

(13.10) 


It  may  readily  be  verified  that  |L^|  >  |li2|,  VsSCRHP,  and  that 

1 L«l  t  j 

(1  -  77— -)  >1/2,  VsScRHP.  The  latter  fact  implies  that  |u, ,  I  >1/2, 

9  *-^0  II 

which,  since  U  is  unitary,  implies  in  turn  that  |u^.  |  >  |u21|,  vs6CRHP. 

It  follows  from  these  bounds  and  (13.10)  that  lv2jl  >  0-  Hence,  VsGCRHP, 
vi(s)  may  also  be  written  using  the  local  trivialization 

The  preceding  remarks  show  that  for  all  s€CRHP  the  matrix  L(s)  may 
be  assigned  the  coordinates 


L  -  [0 


!  0  ] 

V1  V2 


01  6 
0 


0 


°2  6 


j02 


[P 


U1  !  V 


(13.11) 


or 


cos*  -sin$  e 

V  V 

^  0  - 

sin<f>v  e^v  cos<>v 

0  a2  ej^2 

(13.12) 


cosc 


u 


-sintfc  e  "i  ** 
u 


sin*  e^u  cos$ 
u  u 


The  properties  of  L(s)  used  to  guarantee  the  existence  of  the 
parameterization  (13.12)  may  also  be  used  to  approximate  the  values  of 


303 


these  parameters.  In  particular,  such  an  approximation  in  the  frequency 


range  over  which  ^■^>><^2  be  of  interest. 


First,  from  (13.8)  it  follows  that  at  frequencies  for  which 

IL2l 

<<1.  the  angle 


<f>  *  0 

u 


(13.13) 


Moreover,  at  such  frequencies  cos  ifc^58!,  so  that  (13.6a),  (13.11),  and 


(13.12)  imply 


a.  e^l  o  ■  a,  e 

1  v.  1 


J®1  r 


cos4) 


sin<}>v  e^v 


*  U 


(13.14) 


From  (13.14)  it  follows  that 


°i  *  lLi! 


ei  *  1  Ln 


()  58  arctan  . 

v  i  L 


21 1 


11 


K  *  *L21  -*Ln 


(13.15a) 
(13.15b) 
(13.15c) 
(13 . 15d) 


Together  (13.13)  and  (13.15)  show  that  five  of  the  eight  coordinates  in 
(13.12)  may  be  readily  approximated  in  terms  of  the  component  transfer 
functions  of  L(s) .  Since  is  discontinuous  at  ^  ■  0,  it  is  not  possible 


to  approximate  this  parameter.  This  fact  presents  no  difficulty,  however, 
since  ^  has  little  physical  significance  when  ®  58 0. 


Now  recall  that  (13.4b)  shows  that  <  ll^j.  It  is  not  possible, 
however,  to  use  (13.6b)  to  approximate  O2  as  (13.6a)  was  used  to  show 
S’  jjL-J*  This  follows  since  the  first  term  on  the  right  hand  side  of 
(13.6b)  is  the  product  of  a  small  number,  lu^l*  and  a  vector  of  large 
magnitude.  Meanwhile,  the  second  term  is  the  product  of  a  number  approx¬ 
imately  equal  to  one  and  a  vector  whose  magnitude  is  also  approximately 
equal  to  one  at  frequencies  of  interest.  Thus  the  left  singular  vector  v ^ 
(corresponding  to  choosing  the  right  singular  vector  U£  =  [-sin<J>u  e  J  uj  cos^  ]  ) 
cannot  be  approximated  from  (13.6b).  The  subspace  in  which  v ^  lies  may  be 
approximated  from  (13.6a)  and  the  fact  that  v^  and  v^  are  orthogonal.  The 

phase  difference  02  still  cannot  be  approximated,  however,  since  the  condition 
H 

v^  V2  =  0  does  not  constrain  this  parameter. 

The  inability  to  approximate  §2  is  inconvenient,  as  it  was  shown  in 
Section  8.4  that  feedback  properties  depend  critically  on  its  value.  Moreover, 
this  parameter  behaves  in  unusual  ways  which  are  related  to  the  anomalous 
properties  of  the  sensitivity  function  of  this  system.  Thus  it  would  be 
very  desirable  to  be  able  to  predict,  a  priori,  what  the  value  of  §2  would 
be.  This,  of  course,  could  be  of  potential  use  in  design. 

First,  note  that  Equations  (13.13)  and  (13.15)  may  be  used  to 
determine  how  changes  in  the  functions  L„  (s)  affect  the  various  parameters. 
Recall  that  the  Bode  gain-phase  relations  for  scalar  feedback  systems  show  that 
increasing  the  gain  over  a  frequency  interval  necessarily  causes  positive 
phase  lead,  (e.g.,  see  the  Bode  plot  of  a  phase  lead  filter  as  in  [57,  p.  297]). 
Equations  (13.15a)  and  (13.15b)  show  that  the  gain  in  the  first  loop  of  the 
system  (hence  a. [L])  may  be  increased  without  causing  0.  to  change.  This  may 


over  an  interval  while  leaving 


be  done  simply  by  increasing 
| Lii ( j u>)  |  constant.  From  (13.15c)  it  follows  that  $  will  increase. 

Moreover,  since  increasing  ll^Cjw)!  causes  ^I^Qui)  to  increase  while 
holding  |L^^(jo))|  constant  leaves  ^  L^^(joj)  unchanged,  the  difference 
in  phase  between  I^Qw)  and  will  increase.  By  (13.15d)  it 

follows  that  will  increase.  Thus  an  increase  in  the  gain  is 
accompanied  by  phase  lead,  but  not  necessarily  in  all  components  of  the 
output.  As  just  shown,  it  is  possible  to  increase  the  gain  in  one  loop 
of  the  system  by  increasing  only  the  magnitude  of  that  component  of  the  output 
being  fed  into  the  other  loop.  Thus  phase  lead  is  produced,  but  only  in  one 
component  of  the  output.  The  value  of  phase  in  the  other  component  is 
unaffected. 

ISil 

Now  increasing  the  ratio  — - ,  which  causes  <P  to  increase,  also 

In  I 

affects  the  direction  of  outputs  due  to  inputs  in  the  direction  of  the 
singular  vector  u2-  This  follows  since 

r-sin$  e'j'^v  “ 

Lu2  =  a 2  eje2  v  (13.16) 

cos$ 

V 


Increasing  $  thus  leaves  the  value  of  the  gain  a ^  unchanged;  however,  the 
magnitude  of  that  component  of  the  output  fed  into  the  first  loop  of  the 
system  increases  while  that  of  the  other  component-  decreases .  In  addition, 
the  phase  of  the  first  component  lags  that  of  the  second  by  an  amount  tt  -  j/^. 

The  preceding  discussion  showed  that  the  lead  filter  present  in 


L2i (s)  can  be  associated  with  various  changes  in  the  parameters  of  (13.12). 
What  effect,  if  any,  this  has  on  02  is  still  unclear,  since  an  approximation 
to  0O  cannot  be  obtained.  If  a ^  2  were  analytic,  however,  this  problem 


would  not  exist.  The  Bode  gain-phase  relations  would  show  that  62  would 
approach  -90N°,  where  20N°db/decade  is  the  rate  at  which  02  rolls 'off. 

The  claims  of  the  preceding  discussion  may  be  substantiated  in  the 
present  example  by  examining  plots  of  the  various  parameters.  For  each 
reference.  Figures  8.1-8. 7  are  reproduced  as  Figures  13.1-13.7.  Figures 
13.8.a-13.11.b  are  Bode  plots  of  L^(s),  L^Cs)*  I^Cs)  and  I^Cs), 
respectively.  Figure  13.1  shows  how  the  singular  values  of  L  may  indeed  be 
identified  with  the  gain  in  the  two  physical  loops  of  the  system,  (i.e.,  the 
magnitudes  of  the  columns  of  L) .  Figure  13.5  shows  that  the  right  singular 
subspaces  do  indeed  become  aligned  with  the  standard  basis  directions  (which 
in  turn  correspond  to  the  physical  loops) .  Note  the  correlation  between  the 
plot  of  a2  and  the  Bode  plots  of  and  1^2  in  FiSures  13,9  and  13.11. 

Similarly,  note  the  relation  between  0 ^  and  the  functions  L-^Cs)  and  L2i(s)  ,  as 
shown  by  Figures  13.8  and  13.10.  Note,  in  particular,  the  peak  in  a,  'Figure 
13.1)  due  to  the  peak  in  L^.  As  the  approximations  discussed  earlier  show, 
this  means  of  obtaining  a  peak  in  should  leave  the  phase  difference  the  same 
as  if  6^  actually  equalled  That  this  is  indeed  the  case  may  be  verified 

from  Figures  13.7  and  13.8.  Moreover,  Figure  13.6  shows  there  is  a  peak  in 
due  to  the  peak  in  j  I^  |  /  |  [ .  The  associated  phase  lead  in  may  also  be 

seen. 

Finally,  the  phase  difference  Q^,  which  could  not  be  approximated,  is 
plotted  along  with  in  Figure  13.7.  It  may  be  seen  that  6^  increases ,  or 
experiences  phase  lead,  near  the  frequency  range  over  which  d  is  increasing. 
This  would  suggest  that  the  phase  lead  in  L^,  which  produces  phase  lead  in  -Jj  , 
also  affects  9?.  In  Section  13.3  this  effect  will  be  analyzed  quantitatively 
using  the  generalization  of  the  Bode  gain-phase  relations  presented  in 
Chapter  12. 
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Figure  13.1.  Singular  values  and  magnitudes  of  eigenvalues  -  open 
loop  system. 


Frequency  (rad/sec) 

Measures  of  phase  difference  between  singular 
vectors  and  phases  of  eigenvalues  -  open  loop 
system. 


Decibels 


-7071x10s  x(s  + 0.1) 
(s  +10)5 


322 


Finally,  recall  that  the  analysis  of  Chapter  8  showed  that 
the  angle  <j>  between  singular  subspaces  and  the  phase  differences  0^  “  *  u^ 
are  important  in  determining  feedback  properties.  For  the  example 
being  analyzed,  it  was  pointed  out  in  Section  8.4  that  ant*  80 

that  the  latter  parameters  may  be  used  reliably  in  their  place.  This  may 
also  be  seen  using  the  local  trivializations .  From 


H  H  je2 

U2  V2  =  pu2  pv2  6 

it  follows  that 


COS0  =  p  0  i 
1  11  V 

2  v2 


I  s±n<p  sin<j>  eJ  v  u  +  cos$  cos<)> 


Similarly,  02  and  02  are  related  by  the  function  (10.41) 


(13.17) 


e2  -  e2  +  *s2 


=  0~  +  *fcos  p  COS  $ 

2  u  v 


_  -i  C  VI/  _u;  } 

$  e  J  v  u'  +  sin  <f>  sin  4>  ].  (13.18) 

V  U  V 


Now,  since  it  follows  from  (13.17)  and  (13.18)  that  $*•<!>  and 

e2-e2. 


13.3.  Analysis  Using  New  Gain-Phase  Relations 

In  the  preceding  section  approximations  to  some  parameters  describing 
the  singular  value  decomposition  of  L(s)  were  described.  These  approximations  j- 
were  based  upon  the  algebraic  relation  between  the  various  parameters  and  the 


■:-*j 


components  (s) .  Algebraic  relations  are  based  upon  equations  which  must  be 
satisfied  at  each  value  of  frequency.  Analytic  relations,  such  as  the  classical 
Bode  relations,  relate  the  value  of  a  parameter  at  one  frequency  to  values  of 
other  parameters  over  the  entire  range  of  frequencies.  For  example,  this 
accounts  for  the  fact  that  the  phase  lead  in  the  element  is  necessarily 
accompanied  by  an  increase  in  the  gain  of  that  element. 

Recall  that  an  algebraic  approximation  could  not  be  obtained  for 
§2*  In  this  section,  the  integral  relations  from  Chapter  12  will  be  used  to 
show  that  the  value  of  is  completely  determined  by  the  singular  value 

a ^  and  by  the  left  and  right  singular  subspaces.  Since  these  latter  quantities 
can  be  approximated  algebraically,  in  principal  it  should  be  possible  to 
estimate  02  via  its  analytic  dependence  upon  a2  and  the  subspaces.  Rules  of 
thumb  based  upon  the  new  integral  relations  are  not  yet  available;  thus  it  is 
not  possible  at  the  present  time  to  estimate  directly  without  numerically 
evaluating  a  surface  integral.  It  is  possible,  however,  to  obtain  an  indirect 
estimate.  In  addition,  some  qualitative  insights  into  the  system  behavior 
needed  to  produce  the  observed  phase  lead  in  §2  are  obtained. 

In  order  to  apply  Theorem  12.2  certain  conditions  must  be  satisfied. 
The  matrix  L(s)  obviously  has  no  poles  in  the  CRHP;  it  may  easily  be  verified 
that  det[L(s)]  is  nonzero  there  also.  It  was  shown  in  Section  13.2  that  the 
singular  values  of  L(s)  are  distinct  and  that  each  pair  of  singular  vectors 
u^,  v^  may  be  written  using  the  local  trivialization  T^.  In  addition,  the 

constants  k^  are  given  by  k^  *  1,  k2  ■  2  and  the  singular  vectors  may  be  chosen 

Isl  | s I  2 

so  that  u^  +  e^,  U2-*e2,  v^-j1-  and  v2"*’"2—  e2" 

s 


Thus  all  conditions 


necessary  Co  apply  Theorem  12.2  are  satisfied.  The  phase  differences  6^ 
must  therefore  satisfy  (noting  0^(0)  *  0) 


.  -  dlogo  I  | 

"  7  /  “dT  {l0g  COth  2  }dv 

—  CD 

3p„ 


+  £/  (P..  H4^-P..  H  ~T“)drAda 


CRHP  V  i  3r  Ui  3r 


using  a  polar  coordinate  system  defined  by  s  ■  j(jjQ  +  re 


jet 


CP  ,  (13.19)  reduces  to 


oo  dlogo 

Mj“0)  *  7  /  —  {log  coth 

—  00 


1=11 


CRHP 


-  -  dip 

(sin  ♦vir-  sin  *u  ir)drAdo 


(13.19) 


In  coordinates  on 


(13.20) 


The  first  integral  in  (13. 19) -(13. 20)  is  similar  to  the  scalar 
Bode  gain-phase  relation.  Associated  with  this  integral  is  the  rule  of 
thumb  that  a  20N  db/decade  roll-off  rate  produces  a  90N°  phase  lag.  The 
second  term  in  (13.19)-(13.20)  quantifies  the  amount  by  which  motion  of  the 
singular  subspaces  causes  the  gain-phase  relations  to  be  violated.  In  this 
example  it  is  clear  that  the  violation  of  the  gain-phase  relations  is  due 
to  interactions  between  the  physical  loops  of  the  system.  To  see  this,  recall 
the  fact  that  $  0  implied  t^at  the  right  singular  subspaces  were  aligned 

with  the  standard  input  directions.  Thus  any  contribution  from  the  surface 
integral  must  arise  from  the  left  singular  subspaces  changing  with  frequency. 
This  manifests  itself  in  changing  loop  interactions  due  to  changes  in  the 
way  outputs  from  the  high  and  low  gain  subsystems  are  fed  back  to  the  input. 

Note  from  (13.20)  that  the  values  of  6^(jl)  and  ^(jl)  should 
deviate  from  those  expected  from  the  SISO  gain-phase  relations  bv  amounts 
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of  equal  magnitude  and  opposite  sign.  Thus  an  alternative  to  computing  the 
surface  integral  directly  would  be  to  estimate  its  value  by  comparing  the 
value  of  0^(jl)  with  that  of  the  phase  of  a  transfer  function  F(s)  which 
satisfies  |F(joj)  |  *  a^(jui)  .  Since  a^(ju>)  | L^(ju) |  ,  F(s)  will  be  constructed 

so  that  j  F(ju))|  *  jL^(joi)|.  Using  spectral  factorization  techniques,  it  may 
be  shown  that 


(150  x  10 


(s  +  10) 


s  +  2)  (s  +  49) 


(13.21) 


is  a  suitable  function.  Inspection  of  the  numerical  values  of  o^(ju>)  and 
| F ( j u>) |  shows  that  the  two  functions  are  very  nearly  equal.  The  Bode  plot 
of  F(s)  is  shown  in  Figure  13.12.  As  expected  from  classical  techniques, 
the  peak  in  |F(jaj)|  near  uj  «*  10  rad/sec  is  accompanied  by  phase  lead  at 
lower  frequencies.  At  u  ■  1  rad/sec,  *  F(jl)  ■  10.6°.  But  9^(jl)*“  5.7°. 
Thus  the  surface  integral  must  contribute  *  -  16.3°  phase  lag,  and  the 
value  of  ©2 ( j  D  should  be  larger  than  that  predicted  by  the  roll  off  rate 
of  (^(ju)  near  w  ■  1  rad/sec  by  ^16.3°.  From  Figure  13.1  it  appears  that 
o 2  has  approximately  a  40  db/decade  roll  off  for  well  over  a  decade  on 
either  side  of  w  ■  1  rad/sec.  Closer  inspection  (Figure  13.13)  reveals  that 
a  more  accurate  estimate  would  be  »  38  db/decade,  while  the  numerical  values 
of  02  indicate  that  very  near  w  *  1  rad/sec,  O2  rolls  off  at  37  db/decade. 
Thus  if  the  surface  integral  was  not  present  ^(jl)  would  be  roughly  -165° 
to  -172°.  The  loop  interaction  term  shows  that  d^fjl)  should  lie  roughly 
in  the  interval  -150°  to  -156°.  This  agrees  well  with  the  actual  value  of 


e2(ji)  *  -154°. 


Figure  13.13.  Smallest  singular  value  vs.  gain  in  second 
loop,  D ^  2  * 


The  integral  relations  may  also  be  used  to  gain  qualitative 
information  about  the  properties  of  the  system  if  loop  coupling  is  to 
produce  phase  lead  in  Oj.  First,  the  value  of  the  integrand  averaged  over 

the  right  half  plane  must  be  negative.  Since  *°  0  due  to  the  structure  of 

3*  ° 

V 

the  example,  it  follows  that  must  take  negative  values  over  at  least  a 

portion  of  the  right  half  plane.  Moreover,  6  must  be  sufficiently  greater 

3ty 

V 

than  zero  in  this  region  if  the  negative  values  of  are  to  have  an 
appreciable  effect.  Finally,  since  the  surface  integral  essentially  weights 
the  integrand  by  a  factor  of  (1/r) ,  it  follows  that  and  <p  must  attain 
the  appropriate  values  in  the  vicinity  of  s*jl.  Since  the  values  of  both 
4>v  and  t 1>  are  determined  by  L^Cs)  ,  it  follows  that  the  coupling  introduced 
by  this  element  is  critical  to  producing  the  desired  phase  lead.  Thus  a  large 
value  of  seems  necessary  to  achieve  the  phase  lead  resulting  in  the  first 
peak  in  the  sensitivity  function  being  only  8  db.  Recall  also  the  large 
value  of  <Jv  results  in  the  second  peak  in  sensitivity  being  as  large  as  8  db. 
Thus  it  is  reasonable  to  conjecture  that  the  tradeoff  between  the  levels  of 
these  two  peaks  cannot  be  avoided  within  the  basic  structure  of  this  problem. 

Finally,  recall  the  discussion  in  Chapter  12  concerning  the  two 
mechanisms  by  which  changes  in  the  phase  difference  parameter  02  may  be 
produced.  Clearly  both  are  demonstrated  in  this  example.  At  low  and  high 
frequencies  the  change  in  §2  is  dominated  by  the  first  mechanism;  i.e.,  the 
fact  that  a2(ju))  is  rolling  off.  In  the  mid-frequency  range,  however,  the 
second  mechanism  has  a  significant  effect.  Recall  this  mechanism  was  due  to 
a  transfer  of  phase  between  the  loops  of  the  system.  This  transfer  is 


clearly  demonstrated  in  this  example,  since  the  phase  lead  in  I^Cs)  does 
not  produce  phase  lead  in  0^  but  rather  in  ©2«  The  interaction  between 
loops  is  present  in  thfe  form  of  a  significantly  large  value  of  <f>  .  Moreover, 
the  fact  that  the  change  in  phase  is  not  equal  in  all  components  of  the  left 
singular  vectors  (choosing  the  right  singular  vectors  as  u^  *  e^)  is 
demonstrated  by  the  significant  change  in  over  the  frequency  interval  in 
which  §2  experiences  phase  lead,  while  0^  does  not.  Clearly,  the  effect  in 
the  phase  difference  parameters  8^  is  contrary  to  what  one  would  expect  were  each 
eJ  1  a  rational  transfer  function. 

13.4.  Conclusions 

In  this  chapter  the  integral  relations  derived  in  Chapter  12  have 
been  used  to  analyze  the  exartple  in  Section  8.4.  The  same  type  of  analysis 
could  be  performed  on  the  realistic  example  of  Section  8.5,  although  the 
nature  of  the  loop  coupling  in  the  latter  example  does  not  produce  any  phase 
lead  in  §2 •  Even  in  the  example  considered,  more  work,  remains  to  be  done. 

The  parameters  of  the  lead  filter  in  I^Cs)  need  to  be  varied  in  order 
to  gain  an  understanding  of  their  relation  with  $v  and  .  Especially 
needed  is  some  knowledge  of  how  these  parameters  are  affected  at  points 
in  the  open  right  half  plane. 

Another  task  remaining  is  to  numerically  evaluate  the 
surface  integral.  By  evaluating  the  integral  in  steps  some  insight  would  be 
gained  into  how  fast  the  weighting  function  decays  the  effect  of  the 
integrand. 


The  effect  of  singularities  also  needs  to  be  studied.  Recall 
Theorem  12.2  is  applicable  only  to  those  matrices  with  distinct  singular 
values  in  the  right  half  plane.  Yet,  by  reducing  the  gain  in  the  first  loop 
by  a  constant,  a  system  with  the  same  phase  lead  in  6^  may  be  obtained  and 
for  which  the  singular  values  cross  over.  This  suggests  the  assumption  of 
distinct  singular  values  might  be  removed. 

Another  observation  was  that  changing  the  sign  of  L„. (s)  did  not 

dip 

affect  the  values  of  c^Cjw),  or  ~%r~‘  ^et  t*ie  P^ase  leading  effect  was 

no  longer  present.  Further  inspection  revealed  that  o^Cs)  was  zero  at  two 

points  in  the  open  right  half  plane.  Thus  the  new  integral  relations,  as 

expected,  exhibit  the  effects  of  nonminimum  phase  plants.  One  difference, 

however,  is  that  the  integral  relation  for  e^9l  is  still  valid;  only 
i  0 

that  for  a2  eJ  2  i3  affected. 

Finally,  the  ability  to  independently  manipulate  <J>v»  <p^,  and 

is  also  constrained  by  a  set  of  differential  equations.  These  were  not 
stated  in  Chapter  11,  but  a  procedure  with  which  to  derive  them  was  presented 
in  Section  11.3.  Further  study  into  these  equations  is  also  needed. 


CHAPTER  14 


SUMMARY  AND  CONCLUSIONS 

For  a  summary  of  the  results  of  this  thesis,  the  reader  is 
referred  to  Chapter  1  and  to  the  concluding  sections  of  each  chapter. 

This  thesis  comprises  an  incomplete*  attempt  to  generalize  some 
ideas  from  classical  control  theory  to  multiple-loop  feedback  systems. 
Significant  aspects  of  the  classical  theory  were  the  quantifications  of 
various  algebraic  and  analytic  tradeoffs  among  feedback  properties. 
Investigating  how  these  tradeoffs  extend  was  a  major  task  of  this  thesis. 

As  even  the  most  casual  reader  must  have  recognized,  much  substantial  work 
remains  to  be  done.  Specific  areas  needing  further  research  could  be  discussed 
here,  but  Bode’s  invincible  fatigue  is  beginning  to  set  in.  Suffice  it  to 
say  that  the  essential  tools  —  the  approximation  techniques  from  Chapter  8 
and  the  mathematical  theory  from  Chap'ters  9-12  —  appear  to  be  at  hand. 


Although  hopefully  not  insignificant 


APPENDIX  A 


PROOFS  OF  THEOREMS  IN  CHAPTER  3 


The  following  lemma  follows  from  a  minor  modification  of  the  well- 

known  Poisson  Integral  Formulas  [26,27]  for  the  recovery  of  a  function 

analytic  in  the  right  half  plane  from  its  values  on  the  imaginary  axis. 

The  proof  found  in  [26]  is  modified  to  show  that  singularities  of  log  f(s) 

at  jco-axis  zeros  of  f(s)  do  not  contribute  to  the  values  of  the  integrals. 

As  the  required  modification  is  slight,  the  proof  will  only  be  sketched. 

Lemma  A.l:  Let  f(s)  be  analytic  and  nonzero  in  the  closed  right  half  plane 

ji 

except  for  possible  zeros  on  the  imaginary  axis.  Assume  that  - r  log  f(s) 

ds1 


is  in  class  R,  i=0,l,...  . 

Then  at  each  point  s  =  x  +  jy  ,  x  >  0  it  follows  that 

o  o  o  o 

oo  X 

log |  f  (s  )  |  =  7  /  log|f(ju))|  —X - - - r  doj  (A.l) 

°  "  —  xZ+(y  -u>r 

o  wo 


*f(sft)-7  /  *f(jw)  -5 - 2 - 2  dw 

-oo  x  +(y  -w) 

o  o 


(A. 2) 


log  f(s) 


s 

o 


i  /  -^  log  £(») 

-00  ds 


S=JCO 


X 

o 


2  2 
X  +(y  —Cl)) 

o  J  o 


do. 


(A. 3) 


Proof :  Define  the  contour  to  be  the  imaginary  axis  traversed  from  +j°° 
to  -j®  with  semicircular  indentations  of  radius  6  into  the  right  half  plane 
at  the  joj-axis  zeros  of  f(s).  Then  f(s)  is  analytic  and  nonzero  to  the  right 
of  C^.  It  follows  that  log  f(s)  and  its  derivatives  are  analytic  in  this 


region  also. 


The  proof  of  the  Poisson  Integral  Formulas  in  [26]  may  be  followed  to  show 


that  for  each  point  sq  in  the  open  right  half  plane  there  exists  6 
sufficiently  small  so  that 


log  f(sQ) 


Tj  /  lo*  £CO 
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d? 


(A.  4) 


A  straightforward  calculation  shows  that  in  the  limit  as  6  -*■  0  the  integrals 
taken  around  the  semicircular  indentations  vanish.  Thus,  setting  jw 
in  (A. 4)  yields 


00 

log  f(s  )  =  7  /  log  f(jui) 


Similarly, 


2  2 
x  +(y  -w) 
o  o 


dm 


(A.  5) 


log  f (s) 
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_  i  t  A  log  f(s) 
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-on  ds 
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2  .  ,2 
x  +(y  -ui) 
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da) 


(A.  6) 


where  the  improper  integrals  are  defined  using  Cauchy  principal  values 

[26,  pp.  203-204].  The  result  follows  by  taking  real  and  imaginary 

parts  of  (A. 5).  ■ 

Proof  of  Theorem  3.1:  Follows  from  Lemma  A.l  by  setting  f(s)  =  S(s)  and 

noting  § (z)  =  Bp^(z) .  ■ 

Proof  of  Theorem  3.2;  Follows  from  Lemma  A.l  by  setting  f(s)a*T(s)  and 

noting  T(p)  =  B  ^(p)e^T.  ■ 

z 

It  should  be  pointed  out  that  formulas  relating  log | f (sq) ]  to  a 
weighted  integral  of  *f(ju>)  and  4f(sQ)  to  a  weighted  integral  of  log|f (jw)  |  may 
also  be  obtained.  These  follow  from  the  conjugate  Poisson  Integral  Formulas 
[26,  pp.  225-226]  and  may  also  have  implications  for  feedback  design.  However, 
such  implications  will  not  be  pursued  further  at  this  time. 
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Theorem  3.3  could  be  verified  directly  using  contour  integration 
in  the  right  half  plane  with  branch  cuts  extending  from  the  zeros  of  S(s) 
to  infinity  deleted.  Care  must  be  taken  to  deal  correctly  with  the  fact 
that  *S(s)  jumps  by  a  multiple  of  2ir  across  these  cuts.  (This  led  to  the 
incorrect  result  in  [9]).  An  alternative  procedure  is  to  prove  Theorem  3.3 
as  a  limiting  case  of  the  integral  relation  (A.l). 

Proof  of  Theorem  3.3:  Evaluating  (A.l)  for  f(s)*S(s)  and  s*x>0  yields 
(noting  conjugate  symmetry  of  S(joi)  implies  S(ju)  ■  S(-joj))  : 

.  <*> 

log|S(x)|  -  /  log j  S(ju)  |  -2X-  2  dm  .  (A. 7) 

0  x  +u» 

2 

Observing  that  log|S(ju>)|  — r — r  converges  to  log |  S ( Juj)  {  pointwise  as  x 

X  +U) 

approaches  infinity  suggests  that 

»  2 

lim  x  log  |  S(x)  |  »  lim  ~  /  log|S(joi)|  •  dw  (A. 8) 

x  -»■  +  •  x  -►  + 00  0  x  +u) 

might  be  used  to  evaluate 

00 

/  log  |  S  (joj)  |  do) 

0 


R 


(A. 9) 


The  proof  of  this  conjecture  consists  of  three  parts.  First, 


it  is  shown  that  (A. 9)  may  be  approximated  arbitrarily  closely  by 
23 

/  log|S(ju>)  | doj  for  S3  sufficiently  large.  Next,  it  is  shown  that  for 
0  n2 

oj6  [0,2]  the  sequence  — ^ j  converges  uniformly  to  1  as  n  approaches 

n  +w 

infinity.  Together  these  facts  show  that  the  limit  on  the  right-hand  side 
of  (A. 8)  is  finite  and  equal  to  (A. 9).  Finally,  the  limit  on  the  left- 
hand  side  of  (A. 8)  is  evaluated. 

The  power  series  expansion  of  log  S(s) =  -log[l+ L(s) ]  for 
|L(s) |  < 1  is  given  by  [58,  p.  158] 


log  S(s)  =  -L(s)  + 


higher  order  terms. 


Thus,  by  (3.22)  <*>  log|s(joi)|  approaches  zero  at  infinity.  This  implies 

that  there  exist  a  frequency  u  and  positive  constants  M  and  6  such 

Mq 

that  |  log  |  S  (joi)  |  |  <  — j-+^-  for  u>  >  o)q  .  For  2  >  o>q  , 

0) 


/  |  log  |  S  (joj)  |  |  do)  <  / 
5  2 


M 

1+6 

(l) 


0 


From  (A. 10)  it  follows  that  for  any  e >  0  there  exists  a  frequency  2  such 
that 

00 

/  log  |  S  ( ju>)  jdu>  <  e  ( i 

2 

2 

On  the  interval  oo£  [0,2]  it  is  easy  to  verify  that  the  sequence  — — j 

n  +w 

converges  uniformly  to  one  as  n  approaches  infinity.  If  L(s)  has  no 
ju-axis  poles,  then  uniform  convergence  of  the  integrand  suffices  to 
show  that  [48,  p.  71] 


I 


i.- 


Illll.il.  IU 


,..y  |ii  U  I  1  •:  *  —•  W  -  •  ~„S 
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0) 


U) 


lia  /  logjS(ju)J  — = — r  dot  ■  /  log | S(jw)  | du 


x^+w^  0 


(A. 12) 


(If  L(s)  has  poles  in  [0,iu] ,  Chen  it  is  necessary  co  consider  indentations 
into  the  right  half  plane  as  in  the  proof  of  the  Lemma;  these  details  are 
omitted.) 

Together  (A.ll)  and  (A.12)  imply  that 

2 


lia  /  log  1SC»!  -j  j  du  -  /  log|SO)fdw 


2  2 

0  x  +tu 


It  remains  to  evaluate  lia  x  log|S(x)|.  By  definition 


log | S (x )  |  «  logfB"1^)!  +  log|S(x) 


N 


"  2  1°8  TIT  +  log|S(x)| 

J 1  P— * 


i-1 


Since  lia  x  log[S(x)|  -  0,  it  follows  chat 


N 

P 

lia  x  log|S(x)|  ■  I  lia  x  log 

i-1 


Pi4* 


pi’x 


From  the  power  series  expansions  [58,  P*  158] 


logd  +  |)  *  f  -  7  (f)2  +  ••• 


ioga  -f)  -  -f  -i  <f>:  + 


<  1 


<  1 


it  follows  chat  for  x  >  |p 


it  4  .  *  ,  *  ,  ^  •  •  ,  •  *  ‘'*.fc,^**",**«**.k  »*’  *  .* 

j— n.  JilL  f.  JTa.  ■■ '  »  ■ 


(A.  13) 


(A. 14) 


(A. 15) 


■  •_*»  -  l-*«  -  1  f-  -  *  -  '«-»  -  •  L  j  >.  ■.  Ua  —  <- 
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V*  (■  *  IT 

^‘1o8V7!I 

X 


Thus, 


11m  x  log 


pi+*| 


L+  higher  order  ceras. 


2  Re[p . ] 


p,-x  * 


(A. 16) 


Subscitucing  (A. 16)  into  (A. 15)  yields 


lim  x  logjs(x) |  -  Z  2  Re[p  J 
x—  1-1  1 


(A. 17) 


Substituting  (A. 17)  into  (A.8)  and  using  (A.13)  yields 


ir  I  Re[p.]  «  /  logjS(j<i>)  |d<u 
1-1  0 


Proof  of  Corollary:  At  each  frequency  w  >  uc 


;scj«) 


1  <  1 

1+L(ju)  -  1-IUju) 


This  follows  since  jL(Ju)j  <  l  for  u  >  w  by  (3.24).  Moreover,  since 


|L(jui)  |  <  <  1  for  w  >  u»c 


S(j«)j  <  - 

1  "  1+k 
u 


Expanding  log  (  ) ia  a  power  series  yields  (for  a  >  w  ) 

\l  ~  jS*/ 


The  infinite  sum  attains  its  maiHmmn  for  u 


.  Define 
c 


(- m-Y 

V  wJ  i+k 

\U»  /  0) 


M  -  Z 
o  ,  n 
n-1 


-  losi 


1  - 


M 


01 


1+k 


u 


1+k 


Thus,  (j  >  ojc  implies  that 


M 


log|S(ju)|  <-jA- 

ui 


Thus, 


•  M 


/  log  |  S  |  da*  <  /  du 


u 


u»_  u 
c 


fcw 


Substituting  Ho: 


M 


ku 


Since 


M 


1+k 


■  e,  this  reduces  to 


log 


/  log|  S(jw)  | du  < 


I£h 


S 

.  « 

a 


APPENDIX  B 


PROPERTIES  OF  THE  SINGULAR  VALUE  DECOMPOSITION 

The  purpose  of  this  Appendix  is  co  state  some  properties  of  the 

singular  value  decomposition  which  are  used  in  this  thesis.  The  exposition 

is  based  upon  that  of  Stewart  [37,  Section  6.6];  the  reader  is  referred  to 

this  source  for  proofs  and  further  details. 
nxn 

Let  A€  C  .  Then  there  exist  unitary  matrices  U  and  V  such  that 

Vs AU  -  I  (B.l) 

where  I  *  diag[o, ,o», . . . ,o  ]  and  a.  >  a~  >...>  a  >0.  The  numbers  o.  are 
l  z  n  l  -  c  —  —  n  -  l 

referred  to  as  the  singular  values  of  A.  The  columns  of  D  =  [u,  !  ...  !  u  ] 

1  i  i  n 

and  V  -  [v.  |  ...  !  v  ]  are  referred  to  as  the  right  and  left  singular  vectors 
i  i  i  n 

respectively.  The  singular  values  and  vectors  must  satisfy  the  equations 

Aui  “  aivi  (B.2) 

vi  A  =  ui  ai  •  (B.3) 

2  H 

Moreover,  the  numbers  are  the  eigenvalues  of  the  matrix  A  A.  The  right 

singular  vectors  are  eigenvectors  of  this  matrix.  Thus, 

AHAU  «  US2  .  (B.4) 

2  H 

In  addition,  the  numbers  are  the  eigenvalues  of  the  matrix  AA  .  The  left 

singular  vectors  are  eigenvectors  of  this  matrix.  Thus, 


The  singular  values  of  A  are  uniquely  defined.  The  singular  vectors, 
H  2 

however,  are  not.  If  A  A  han  an  eigenvalue  a  with  multiplicity  k,  then  the 

corresponding  columns  of  U  may  be  chosen  as  any  orthonormal  basis  for  the 
H  2  2 

eigenspace  of  A  A  corresponding  to  a  .  If  a  >  0  then,  once  the  right  singular 

vectors  are  chosen  in  this  fashion,  the  left  singular  vectors  are  uniquely 

determined  from  (B.2).  (Similar  arguments  starting  with  the  left  singular 

vectors  can,  of  course,  be  made  using  (B.5)  and  (B.3).) 

Note,  in  particular,  that  there  exists  a  degree  of  freedom  in 

choosing  even  a  right  singular  vector  corresponding  to  a  singular  value,  o^, 

of  multiplicity  one.  Let  u^  be  a  right  singular  vector  corresponding  to  ck, 

and  let  vi  be  the  corresponding  left  singular  vector.  Then,  from  (B.2),  it 

follows  that  choosing  u^  •  eJ  u^  ,  a€  ]R,  as  a  new  right  singular  vector  yields 
~  jot 

Vj.  ■  eJ  v^^  as  the  corresponding  new  left  singular  vector.  Analogous 

statements  hold  for  singular  values  of  multiplicity  greater  than  one. 

Let  a  ■  a.. ,  the  largest  singular  value,  and  let  o  =  a  ,  the  smallest 
x  —  n 

singular  value.  Then, 

;-|a|2  cb.6) 

where  | jj  denotes  the  matrix  norm  induced  by  the  standard  Euclidean  vector 
norm.  Assume  detA#0,  so  that  A  ^  exists.  Then, 


The  singular  values  satisfy 


M 


F 


(B.9, 


The  singular  values  of  A  are  related  to  the  eigenvalues  of  A  by 


(B.ll 


Moreover,  if  A  is  normal  (e.g.,  if  A  is  Hermitian  or  unitary),  then  the 
eigenvalues  of  A  may  be  ordered  so  that  =  |X^|. 

Since  the  singular  values  of  A  are  the  nonnegative  square  roots  of 
a  Hermitian  matrix,  they  inherit  many  of  the  interesting  properties  of  such 
matrices.  Among  these  are  the  following  min-max  property: 

h2 

Oj  ■  min  max  — n — b —  din# .  =  n-i+1  .  (B.l. 

1  S4  u£j,  HI 
1  urt1 


The  minimum  in  (B.9)  is  taken  over  all  subspaces  of  fn  with  the  appropriate 
dimension.  In  particular. 


a[A] 


max 

u€  *n 

u^O 


BAui2 

Hz 


a[Aj  > 


Vu€  «n 


(B.l: 


(B. 


In  addition, 


Finally,  one  other  property  which  singular  values  share  with 

9 

eigenvalues  of  Hermitian  matrices  is  that  they  are  numerically  well-conditioned. 
In  fact,  given  A,  AA€  Cn*n,  then 

a [A  +  AA]  >  a [A]  -  a[AA] 

Thus,  if  a[AA]  <  a[A]  and  A  is  nonsingular,  then  A  +  AA  is  also  nonsingular. 

In  fact,  given  A  nonsingular,  the  matrix  AA  of  smallest  magnitude  for  which 
A  +  AA  is  singular  has  magnitude  a[AA]  »  cj[A].  One  such  matrix  is  given  by 
AA  *  -  a  v  u  ,  where  a,  v,  and  u  are  the  smallest  singular  values  and 
corresponding  singular  vectors  of  A. 


APPENDIX  C 


PROOFS  OF  THEOREMS  IN  CHAPTER  5 


Proof  of  Theorem  5.1:  Note  the  sensif  /ity  matrix  S(s)  *  [I  +  L(s)]~^ 
has  no  poles  in  the  closed  right  half  plane  by  assumption  of  closed  loop 
stability.  The  function  det  S(s)  thus  has  no  poles  in  the  closed 
right  half  plane  but  has  zeros  at  the  open  right  half  plane  poles 
{pi  ;  i=l,...,Np}  of  L(s) .  It  is  necessary  to  remove  these  zeros. 
Define 


D(s)  =  B  1(s)  •  det  S(s) 
P 


where  Bp(s)  is  the  Blaschke  product 


N 


Bp(s) 


P  Pt  -  s 


n  - 


i-1  *1 


p,  +s 


( 


0 


Then  D(s)  has  no  zeros  in  the  open  right  half  plane,  log  D(s)  is  analytic 
there,  and  log|D(s)(  =  log | det  S(s)J.  A  little  analysis  reveals  that 


log  det  S(s)  =  -  log  det  [I  +  L(s)] 

*  -  log  [1  +  I  (products  of  L^Cs))] 


0 


The  fact  that  (5.24)  is  satisfied  implies  that  the  £(•)  term  on  the  right- 
hand  side  of  (C.3)  goes  to  zero  as  w-*-®.  In  particular,  there  exists  a 
frequency  such  that  |  E  ( - )  |  <1  for  ui>w^.  At  these  frequencies  (C.3) 
has  the  power  series  expansion 

E2f  , 

log  det  S(s)  *  -  £(•)  +  — ^ —  +  higher  order  terms.  (< 
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Thus,  d)log|det  S(ju>)  |  also  approaches  zero  as  At  this  point  the 

proof  of  Theorem  3.3  may  be  followed  to  yield  the  result. 

Proof  of  Corollary  5.2:  Follows  from  the  fact  that 


det  S(s) 


n  c.[S(jco)] 
i-1 


Note  that  a  result  similar  to  the  Corollary  in  Chapter  3,  Section  3.3  is 
available  in  the  MIMO  use.  Suppose  the  design  constraint 

M 


a [L(jw)  ]  <  <  e<  1,  w  >  uc 

(D 


(C.5) 


is  imposed,  where  k>  0  and 


M 


1  +  k 


e.  Then,  straightforward  manipulations 


O) 


yield,  for  all  u)>o>c; 


o[S(jcj)  ] 


o[S(joj)  ]  < 


a[I  +  LCju)] 
1 

-  1  -  a[L(ju))  ] 
1 


(C.  6) 


1  - 


M 


1  +  k 


The  proof  of  the  Corollary  in  Appendix  A  can  be  followed  to  yield 

“  iogtTTTK 

/  log  ffi[S(j«)]  <  - 1  k-C 

^c 


(C  .7) 


■a 


r.j 


u 4 


APPENDIX  D 


CANONICAL  ANGLES  BETWEEN  SUBSPACES 


First,  some  facts  about  the  angles  between  singular  subspaces  are 
stated  and  used  to  prove  some  identities  involving  the  matrices 
Denote  the  column  spaces  of  and  by  and  respectively.  Then, 


from  (8.1),  U  and  are  k-dimensional  subspaces  of  Cn.  From  [36  and  59, 


Theorem  2.3]  it  follows  that  there  exist  unitary  matrices 


D  -  [0.  i  Uj  and  V  =  [V,  \  V,] 
I  I  L  1  I  * 


such  that  the  columns  of  and  span  1L  and  9^  respectively,  and  so  that 
(assuming  2k  <  n) : 


A  - 


Ul\ 


U  ny 

2  1  u2  2 


u 

1 

1 

1  01 

1 

l 

1 

1 

s 

1 

0 

-s 

1 

I 

1 

1 

1 

1 

1 

1  10 

1 

0 

\ 

0 

1 

1 

1 

0 

Xn-2k_ 

(D.l 


(If  2k >  n,  then  the  identity  block  appears  in  the  upper  left  hand  corner.) 
The  matrices  C  =  diagfcosa^]  and  S  -  diagfsina^],  i=l, — ,k,  contain  the 
cosines  and  sines  of  the  canonical  angles  between  the  subspaces  and  9^. 


The  canonical  angles  between  subspaces  are  characterized  as  follows. 


Definition  D.l:  [36]  Let  U  and  V  be  subspaces  of  Cn  with 
dy  =  dim  li  >  dy  =  dim  t)  >  1 

The  canonical  angles  [0,tt/2]  between  II  and  9  are  defined  for  m=l,. 
by 


(D.2) 

,dV 


cos$ 

m 


I  H 

max  max  (u  v 
u  €  u  v  e  v 


(D.3) 


with  |uj  2  ■  1,  jv|  2  “  1  and  subject  to  the  constraints 
u.Hu  -  0  ,  v.av  =  0  i*l,...,m-l 

i  l 

The  vectors  u  and  v  ,  m=l,...,q  are  called  the  canonical  vectors  of  the 
mm 

pair  of  subspaces.  ■ 


Remarks : 

(1)  The  canonical  angles  are  uniquely  defined;  the  canonical 
vectors,  however,  in  general,  are  not. 

(2)  With  the  above  numbering,  is  the  largest  canonical  angle 

and  is  computed  from  cosb,  *  max  max  u  v  ,  with 

1  u  €  11  u  S  1? 

9  ul  2  "  I V1  2  =  X- 

(3)  The  vectors  {v^;  i*l,...,d^}  form  an  orthonormal  basis  for 
C.  The  vectors  {u^;  i=l,...,d  }  may  be  augmented  with 

dy  -  dy  vectors  so  that  the  expanded  set  {u^;  i=l,...,dy} 
forms  an  orthonormal  basis  for  U.  In  [36]  it  is  claimed 
that  the  condition 


i*j;  i-1,. .  •  .dyj  J-l . d. 


(D.  4) 


holds . 


Note  that  k  of  the  canonical  angles  between  the  subspaces  and  ^ 
are  the  same  as  those  between  11^  and  In  addition  there  are  guaranteed 

to  exist  n-2k  angles  equal  to  zero.  This  follows  since  the  assumption  2k  <  n 
implies  that  the  dimension  of  ^^^2  is  at  ^east  n“2k* 

Since  the  columns  of  and  both  span  U^,  there  exist  unitary 
matrices  P.  such  that 

l 


l"l  !  D2!  ■  I  "21  fPlH  0 


P2e  c( 


(n-k) x(n-k) 


(D.5) 


Similarly  there  exist  such  that 


tVl  i  v2]  “  *1  !  V  fQl  ° 


Q^C 

Q2et( 


(n-k) x (n-k) 


(D.6) 


Equations  (D.l),  (D.5)  and  (D.6)  may  be  used  to  obtain  expressions 

JJ 

for  the  singular  value  decompositions  of  the  matrices  .  Simple 

calculations  yield 


U 

l^V  «  11 


u 

U1  V2 


u  u 

U2  1  2  2 


PlCQi 


P1IS  0]  Q2 


c  o 


0  I 


(D.  7) 


The  singular  value  decomposition  (SVD)  of  U^^V,  is  the  (i,j)-th  block  of  (D.7) 
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The  SVD's  of  can  be  used  to  derive,  through  straightforward 


manipulations,  the  following  identities: 


(D.9) 


"(Ui\)"1(U1Hv2)  "  ^ViHU2)  (V2HU2)-1  (D.8) 

(U^HU^)"1  *  -(V2\)~\V2\)  (D.9) 

(U2Hv2)  +  <U2\)<V1Hu2)(V2Hu2)~1  “  ^V"1  *  (D.10) 

Finally,  it  is  easy  to  verify  that  the  singular  values  of  the  matrix 
products  on  either  side  of  (D.8)  and  (D.9)  are  equal  to  the  tangents  of  the 
canonical  angles. 

Lemma  D.l:  Assume  that  det [U2**V2]  ^  0*  Then 


x  -  u2(v2hd2)"1v2h  -  v1(u1Hv1)"1u1i 


(D. 11) 


Proof :  Note  the  fact  that  det [U2**V2]  1*  0  implies  that  det[U^HV^]  f  0. 
Now 

i  -  JJ2^2\r\E  *  i  -  [v^11  +  v2v2h]u2(v2hu2)"1v2h 

-  i  -  v1(v1hu2)(v2hu2)‘1v2  -  v2v2h 

-  v^11  -  v1(v1hu2)(v2hu2)‘1v2h 


Using  (D.8)  yields 


i  -  u2(v2hu2)_1v2h 


viviH  +  v1(u1hv1)-1(u1hv2)v2h 

V1(U1HV1)"1(U1HV1)V1H  +  v1(U1\)"1(u1Hv2)V2H 
V1(U1HV1)'1U1H  •  " 


“  m  •  •  Vk  ’  «  *  . 


•  »  j  u 

4  <  «■  '  *  *  1 


APPENDIX  E 


PROOFS  OF  THEOREMS  IN  CHAPTER  8 

Proof  of  Lemma  8.2;  Condition  (8.13)  implies  that  >:>  ^  which 

from  (8.7a),  implies  that  A  ^  exists  and 

A-1  =  (I  +  UjV^)'1 

*  I^-OJ^)"1 

Using  this  approximation  in  (8.7b-d)  yields 
B  *  i"1(u1hv1)"1uiHv2z2 
c  *  I  +  -  u2hv1(u1hv1)'1u1hv2]z2 

D  *  U2HV1(U1HV1)'1 
Using  (D.10)  in  (E.3)  yields 

c  85  (v2hu2)"1[v2hu2  +  z2] 

Using  the  above  approximations  to  (A) -(D)  in  (8.6)  yields  (8.14)  : 

§11  *  ^(UiV-1!!  +  U1Hv2Z2(V2HU2  +  E2)"1(V2HU2)(U2HVi)(UiHVi)"1]  . 
Using  (D.9)  in  (E.6)  yields  (8.14a) 
sl2  »  -E-1(U1\)-1U1HV2Z2(V2H02  + 

s,.  *  -(vA,  +  r,)"1(v9Hu,)(u7Hv.)(u.Hv1)"1 


Using  (D.9)  in  (E.8)  yields  (,8.14c) 

S22  -  <y2\  *  . 

Proof  of  Theorem  8.3;  Approximation  (8.15a)  follows  from  (8.14)  by  requiring 
that 

cfE^  »  ^{(u^)"1!!  -  (P^v2n2crfv2  +  Z2)"1(V2HU1)]}  (E.9) 

and 

o[Z1]  »  aUU^)"1  (U1HV2)E2(V2HU2  +  (E*10' 

so  that  «  0  and  S12  »  0.  Using  the  resulting  approximations  to  S  and 
transforming  into  standard  coordinates  yields  (8.15b) 

s  *  u2(v2nu2  +  Z2)“1(V2HU1)U1H  +  V2(V2\  +  i2)’1(v2hu2)u2h  . 

Using  the  identity  U^U^E  +  ^<^2^  =  ^  yields  (8.15a)  and  the  identity 
ViViH  +  V2V2H  -  I  yields  (8.15c). 

Approximation  (8.16a)  follows  from 

T  »  T  -  I  -  U^vJ^J.  +  Z-)_1V_H  . 

app  2  2  2  2  2 

Note  that 

v2hu2(v2hu2  +  e2)-1  +  22(v2nu2  +  z2)-1  -  I 

-  (V2\  +  E2)_1  +  (V2HU2)'1E2(V2HU2  +  E^'1  -  (V^U,,)'1 


(E.ll 


Using  Lemma  D.l 


Tapp  ■  V“lW  +  +  V'V 

which  is  the  desired  result. 

To  obtain  (8.16b)  note 


T 

app 


T 

app 


+ 


+ 


viCui\)~1  [(ui\)viH  +  (u1Hv2)v2H] 

^i^iS*  +  V2(V2HU2)3  (v2hu2)-1z2(v2hu2  +  i2)-1v2h 

V1V1H  +  WV2\  *  V'V 

vi[(ui\)‘1(uiHv2)  +  (V1HU2)(V2HU2)-1Z2(V2HU2  +  z2)_1 


]V 


(E.l 
H 


Using  (E.ll)  in  (E.12)  yields 


Tapp  ’  Vl"  +  V2E2CV2Hu2  +  Z2)'lv 


-la,  H 
2 


+  vi r (U1HV1) "1(U1HV2>  +  (V^KV^)"1  -  CV1HU2)  (E.l 


•  (v^  +  z2)'1]v2h 


Using  (D.8)  in  (E.13)  yields  the  result.  Expression  (8.16c)  is  obtained 
similarly.  ■ 

Proof  of  Theorem  8.1:  First  note  that  (3.9)  implies  (V2^U2  +  Z2)  ^  58  (V2**U2)  * 
Using  this  approximation  in  (8.15a)  yields  (8.10a).  Expressions  (8.10b)  and 
(8.10c)  follow  easily.  Using  (D.ll)  in  (8.10a)  yields  (3.11a)  from  which 
(8.11b)  and  (8.11c)  follow  easily.  ■ 


Proof  of  Lemma  8.4;  It  may  easily  be  verified  that  an  alternate  expression 
for  S  is  given  by  (assuming  E  ^  and  G  ^  exist) : 


-G_1H 


-FG-1  E-1  +  FG_1H 


(E.14) 


where 


E  »  I  + 


f  « (i  + 

G  -  I  +  -  (U^I^CI  +  U2Hv2Z2)~1(U2HviEl) 

H  -  (u1hv2z2)  (I  +  U^Ij)"1 

Condition  (8.19)  implies  that  (I  +  ^2^  2^2^  1  exists  and 
E  *  I 


(E.15a) 


(E.15b) 


(E.15c) 


(E.15d) 


U 

F  **  U  V  l 
2  11 

G  I  + 

H  *  °lHv2!:2 


-  <o1\i:2HD2Hv2£2) 


(E.16a) 
(E. 16b) 


(E.16c) 


(E.16d) 


The  approximations  (E.16)  imply  that  (8.20)  holds, 


Proof  of  Theorem  8.5;  From  (8.20)  it  follows  that  if  a [E 2 3  is  sufficiently 
small,  then  the  appropriate  inverses  exist  and 


su  *  (i  +  u1Hv1z1)' 


S,„  **  0 


(E.17a) 


(E. 17b) 


S21  88  -(U2HV1E1)CI  + 


S  as  T 
&22  1 


From  (E.17)  Che  matrix  S  Is  given  In  standard  coordinates  by 


S  UX(I  +  U1HV1E1)"1U1H  -  u2 <u2HV1Z1)  (I  +  +  U2U2H 


which  implies 


Note  that 


I  -  (I  +  *  O^l*1  +  UiHviZi)-1 


Using  (E.20)  in  (E.19)  yields 


t  *  u1(u1Hv1)z1(i  +  u1Hv1z1)“1  u1h  +  u2(u2hv1e1)(i  +  u1hv1e1)'1u1h 


or 


(E.17c)  •* 
CE.17d) 


(E.  18) 


T  38  U1U1H  "  U1(I  +  Wl^V  +  U2(U2HviZl)(I  +  Ul\Il)"’luiH  *  (E*19) 


(E.20) 


"J 


(E.21)  u 


¥ 


(E. 22) 


from  which  (8.22b)  and  (8.22c)  easily  follow.  To  obtain  (8.21a)  from  (8.22a) 
define 


S  -  I  -  V.E. (I  +  U.  E. ) _1U.H 
app  1  1'  111  1 


app 


(E. 23) 


and  note 


ui\Ei(I  +  uiHviei)"1  +  C1  +  -  i 

-  EX(I  +  +  cu1Hv1)“1(i  +  U^IjT1  -  (U^)"1 

Sapp  •  1  -  VDlW  +  V1<D1Hvi>'1'1  + 

Using  (D.ll)  in  (E.26)  yields 

Sapp  *  V^V'S"  +  +  niVl>'V  • 


(E.24) 


To  obtain  (8.21b)  note 

Sapp  "  u2(V2Hu2)"1[CV2\)U1H+ CV2Hu2)D2Hl+  [U1(U1Hv1)+U2(U2\)] 
or 

Sapp  *  U2U2H  +  +  V*  +  V2UV2\)-hv2\)  +  (U2\)  (U^) 

•  (I  +  Uj^Z^-1]^11 

Using  (E.24)  in  (E.28)  yields 

Sapp  ■  W  *  h(1  * 

+  U2[(V2HU2)-1(V2HU1)  +  (  u2hv1)(u1hv1)“1 

-  (U2HV1)Z1(I  +  U1HV1Z1)"1]U1H 


From  (D.9)  it  follows  that 


Sapp  '  U2U2H  +  U1(I  +  *  U2(U2HV1,E1(I  +  "iV^'V 


From  which  (8.21b)  follows. 

To  derive  (8.21c)  from  (E.27)  first  note 

(uiHV"1(I  +  ui^i^i)_1  “  I Ci  +  u1Hv1z1)(u1Hv1)]-1 

-  [(u^^d  +  i:1u1Hv1)  ]_1 

-  (i  +  r1u1Hv1)"1(u1\)"1 


Using  (E.29)  in  (E.27)  yields 

Sapp  -  V^V'V  +  V1(I  +  El°lHvi)"1(Ul\)‘luiH 
=  [v1(v1hu2)  +  v2(y2\)W2\)-\« 

+  vxd  +  z1u1Hv1)‘1(u1Hv1)'1[(u1Hv1)v1H  +  (uxhv2)v2h] 

=  v2v2h  +  VX(I  + 

+  v1[(v1hu2)(v2hu2)"1  +  (i  +  z1u1Hv1)'1(u1Hv1)~1(u1Hv2)]v2H 


Using  the  identity 

(I  +  Z^r1  *  I  -  (I  + 

yields 

Sapp  “  V2V2H  +  V1(I  +  Vl'W 

+  V1[(V1HU2)(V2HU2)"1  +  (u1hv1)"1(u1hv2)  -  (I  +  z1u1hVi)'1ZiuiHv2]v; 


t*  V4 
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APPENDIX  F 

COUNTEREXAMPLE  TO  CONJECTURE  THAT  LOG  o  IS  HARMONIC 

Let 

s  0" 

M(s)  -  .  (F.l) 

_2  s_ 


so  that 


From  (F.5) 


Define 

K  £  1 

l  2 

o,  (Y  +  1) 
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Then,  combining  (F.7),  (F.9)  and  (F.ll)  yields 


2  2 

a  logo,  a  logo.  _ 

- +  - _±  ,  K  [2Cy  +  1  +  A  +  1  ) 

ax  ay 


1  +  A  +  1  ,  2  ,  2S 

—  _ ; -  (x  +  y  ) 


fy  +  1 


+  2  +  2  * 
A  +  l 


A  +  l 


-  (X2  +  y2)] 


(F.13) 


V2loga 


^  l  /|  s  j  ^  +  i 


Similarly , 


V  logo 2 


si2  +  1 


(F.14) 


APPENDIX  G 


DIFFERENTIABILITY  OF  SINGULAR  VALUES  AND  VECTORS 

In  this  Appendix,  various  results  concerning  the  differentiability 
properties  of  singular  values  and  vectors  are  presented.  These  results  are 
based  upon  properties  of  eigenvalues  and  eigenvectors  of  Hermitian  matrices; 
these  properties  are  discussed  extensively  in  Kato  [32].  There  is  some 
overlap  between  this  Appendix  and  earlier  results  by  Freudenberg,  Looze  and 
Cruz  [10]  and  MacFarlane  and  Hung  [60].  This  overlap  will  be  pointed  out  in 
the  discussion. 

Singular  values  are  by  definition  real  valued  functions  of  the 

complex  frequency  variable  s  =  x  +  jy.  It  follows  [48  ]  that  they  cannot  be 

even  locally  analytic  in  this  variable.  Recall  one  characterization  of 

analytic  functions  is  the  existence,  at  each  point  in  the  domain  of 

analyticity,  of  a  convergent  power  series  expansion  with  nonzero  radius  of 

convergence  [48  ] .  Lack  of  analyticity  implies  the  nonexistence  of  a  power 

series  expansion  in  s;  however,  it  can  be  shown  that  such  an  expansion 

exists  in  the  two  separate  variables  x  and  y.  This,  in  turn,  shows  that 
/ 

singular  values  are  analytic  in  these  two  variables.  This  fact  was  pointed 
out  by  MacFarlane  and  Hung  [60].  The  results  of  [60]  concerned  only 
analyticity,  however,  and  were  limited  to  the  case  of  distinct  singular 
values.  This  is  inconvenient  when  studying  properties  of  repeated  singular 
values.  In  particular,  the  present  motivation  for  discussing  analyticity 
in  the  two  variables  is  that  analyticity  implies  the  existence  and 
continuity  of  partial  derivatives  of  all  orders.  The  converse  is  not 


9 


true,  however,  and  the  present  results  can  be  used  to  discuss  existence  of 

partial  derivatives  even  for  multiple  singular  values  for  which  no  power 

series  expansion  is  possible.  First  some  definitions  will  be  presented. 

Definition  G.l:  A  function  f  defined  in  a  neighborhood  of  a  point 
2  k 

x=  (x^.x^J^iR  is  of  class  C  at  (x,y)  if  the  kth  order  partial  derivatives 

gkf  k 

-  ,  a, +a-*k,  exist  and  are  continuous  at  x.  Functions  in  c 

1  2 

are  said  to  be  k-times  continuously  differentiable.  ■ 

Definition  G.2  [61,  p.  2]:  A  comp lex- valued  function  f  defined  on  an  open 
2 

subset  d£  C  is  analytic  in  D  if  each  point  w  »  ^wi*w2^e^  has  an  open 
neighborhood  N,  w€n£D,  such  that  the  function  f  has  a  power  series 
expansion 


,(2)  ■  kj.0  cki  (2r“i)k(V“2)l 


(G.l) 


which  converges  for  all  z  -  (z^,Z2)€N.  ■ 

Definition  G.3:  A  complex  valued  function  f  defined  on  a  neighborhood  of 

2 

a  point  w  -  (w^,W2) is  analytic  at  w  if  there  exists  an  open  neighborhood 
N,  w£N  CD,  such  that  the  function  f  has  a  power  series  expansion  (G.l) 
at  w  which  converges  for  all  zGN.  ■ 

Thus,  a  singular  value  is  analytic  at  (xQ,yo)  provided  o.^  has  a 
power  series  expansion 


ai(x,y) 


kj-0  C“ 


(G.2) 


which  converges  for  all  (x,y)  in  a  neighborhood  of  (x0»yQ)  in  ®  •  Note 
that  for  real  values  of  (x,y)  and  (xQ,yo)  the  series  (G.2)  sums  to  the 
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value  of  [M(s *  x  + jy) ] ,  where  M(s)  is  the  matrix  being  studied.  This 
fact  implies  that  the  are  real  numbers. 

Given  the  expansion  (G.2),  the  partial  derivatives  at  (xQ,yo)  with 
respect  to  x  and  y  may  be  determined  from  the  c^*  Even  if  an  expansion  (G.2) 
does  not  exist,  however,  the  partial  derivatives  may  still  exist  and  be 
continuous.  Results  will  also  be  presented  for  the  singular  projections 
and  singular  vectors. 

Definition  G.4:  Let  ^(s)  be  a  right  singular  subspace  of  M(s)  .  (Note 

U^s)  is  not  necessarily  one-dimensional.)  The  orthogonal  projection 

[32,  p . 57 ]  onto  U.(s)  is  denoted  by  P  and  is  termed  the  singular  projection 
i  u± 

onto  the  subspace  11^.  Similarly,  will  denote  the  singular  projection 
onto  the  left  singular  subspace  9^.  ■ 

Theorem  G.5 :  Let  M(s)  be  a  matrix  taking  values  in  CnXn  whose  elements 
are  functions  of  the  complex  variable  s»x  +  jy.  Let  DG  I  be  a  simply 
connected  region  of  the  complex  plane.  Assume  that: 

(i)  For  all  s^D  each  element  (s)  is  analytic  in  s. 

(ii)  The  number  of  distinct  singular  values  is  constant  in  D. 

Note  this  implies  that  the  singular  values  of  M(s)  can  be  ordered  so  that 
a^[M(s)]  >  c^tM^)]  >  ...  >  an[M(s)]  for  all  s£D.  Moreover,  if  two  singular 
values  are  equal  at  any  point  of  D  they  are  equal  Vs  G  d. 

If  (i)  and  (ii)  are  satisfied,  then  at  each  point  so*xo  +  jyQ€D: 

CO 

(1)  If  o^[M(sq)]  t  0,  then  c^  is  of  class  C  •  That  is,  partial 
derivatives  of  all  orders  exist  and  are  continuous. 

(2)  Each  right  and  left  singular  projection  is  of  class  C  . 

Let  condition  (ii)  be  replaced  by: 


(ii)'  The  singular  values  of  M(s)  are  distinct  for  all  s^D. 


b 


K 


E 


IK" 


L  • 
If  ' \ 


If  (i)  and  (ii) '  are  satisfied,  then  at  each  point  so*xQ  +  jyo€D: 

(1) '  If  oi[M(so)]  j4  0,  then  a ^  has  an  expansion  of  the  form 
(G.2)  and  is  thus  analytic  in  the  sense  of  Definition  G.3. 

(2) ’  Each  right  and  left  singular  projection  is  analytic.  ■ 
Proof :  At  each  point  s0*x0  +  jy0SD  it  follows  from  (i)  that  M(s)  has  a 
power  series  expansion 


MCs)  -  Z  M. (s-s  )' 
„  „  l  o 
i-0 


(G.3) 


Now,  consider  M  as  a  function  of  the  two  variables  x  and  y,  where 
these  variables  may  assume  complex  values.  It  will  now  be  shown  that  M(x,y) 
is  analytic  in  (x,y)  at  (x  ,y  ). 

Note  that  (G.3)  shows  M(x,y)  is  continuous  in  x  and  y.  By  Osgood’s 
Lemma  [61,  p.  2]  it  follows  that  if  M(x,y)  is  analytic  in  x  and  y  separately, 
then  M(x,y)  is  analytic  in  (x,y).  First,  setting  y =  yQ  in  (G.3)  yields  a 
power  series  expansion  in  x: 


M(x,y  )  -  I  M. (x-x  ) 
i-0 


(G  .4) 


This  power  series  has  a  nonzero  radius  of  convergence  since:  (a)  the  series 
(G.3)  has  radius  of  convergence  R>0;  (b)  the  series  (G.4)  converges  for 
real  values  of  x  with  |x-x  |  K  R»  (c)  by  Theorem  1.3,  Chapter  III  of 
Conway  [48]  it  follows  that  M(x,yQ)  in  fact  converges  for  all  complex 
values  of  x  with  |x-xQj  <  R.  Thus  M(x,y)  is  analytic  in  x  at  (xo,yQ).  A 
similar  argument  shows  that  M(x,y)  is  analytic  in  y  at  (x0»yQ)*  As  remarked 
above,  Osgood's  Lemma  then  implies  that  M(x,y)  is  analytic  in  (x,y)  at 


V.l 


•• 


--  A 


Define  the  matrix 


M(x,y) 


E  MH(x-jy-s  )A 
i*0 


(G.5) 


Note  that  for  real  values  of  x  and  y 

M(x,y)  *  [M(x,y)]H  =  MT(s)  (G.6) 

ff  II 

where  denotes  complex  conjugation.  Thus,  M  is  not  analytic  in  s, 

yet  M  is  analytic  in  (x,y)  by  the  same  argument  used  for  M(x,y).  This,  of 
course,  is  a  consequence  of  the  definition  of  M;  the  definition  does  not 
involve  taking  the  complex  conjugates  of  x  or  y. 

Next,  the  matrices 

M^x.y)  »  M(x,y)M(x,y)  (G.7) 

and 

(x , y )  =  M(x,y)M(x,y)  (G.8) 


are  also  analytic  in  (x,y) .  Moreover,  the  singular  values  of  M(sq) , 
so  =  xo  +  jyQ,  are  the  nonnegative  square  roots  of  the  eigenvalues  of 

Vx  ,y  )  and  (x  ,y  ) ;  the  right  singular  subspaces  of  M(s  )  are  the 

O  O  L  o  o  o 

eigenspaces  of  and  the  left  singular  subspaces  are  the  eigenspaces  of  M^. 

To  summarize,  at  each  point  s  «x  +jy_eD,  the  matrices  M..  (x,y) 

O  O  O  L 

and  M  (x,y)  are  analytic  functions  of  (x,y)  in  some  neighborhood  N  of 
K 

A  ^ 

(xQ,yo) €  JT .  Note  this  implies  these  matrices  are  C  in  x  and  y.  By  Kato 

and  Mr 

are  also  C°°  functions  of  Cx,y)  in  the  neighborhood  N  of  (xQ,yo) .  Note  the 


[32,  p.  134,  Theorem  5.13a]  the  eigenvalues  and  eigenprojections  of 


eigenprojections  of  are  the  left  singular  projections  of  M  and  the 
eigenprojections  of  are  the  right  singular  projections  of  M.  This 
proves  (2) . 

If  a^[M(sQ)]  i4  0,  then  by  continuity  there  exists  a  neighborhood 
N'  £  N  of  (xQ,yo)  such  that  ai£M<x»y>i  **  0  V(x,y)SN'  £  n.  Since 
o.jMCx.y)]  *  +  / \ ~  [MrCx  .y)l  C“  +  ^itM^^.y)]) ,  it  follows  that  (^[MCx.y)] 

QO 

is  C  in  N'.  This  proves  (1). 

2 

Now,  suppose  that  Cii)'  holds.  Allow  (x,y)  to  take  values  in  C 
2 

rather  than  3R  .  By  continuity,  (ii)'  implies  that  the  eigenvalues  of 
M^x.y)  and  M^x.y)  are  distinct  in  some  complex  neighborhood  of  each  point 
(xQ,yo)  with  xq  and  yQ  real.  Thus,  by  Kato  [32,  p.  134,  Remark  5.13b]  it 
follows  that  the  eigenvalues  and  eigenprojections  of  and  are  analytic 
in  (x,y) .  Conclusions  (1)'  and  (J2) '  follow.  ■ 

The  preceding  theorem  shows  that,  away  from  points  where  the 
number  of  distinct  singular  values  changes,  partial  derivatives  of  all 
orders  of  the  singular  values  exist  and  are  continuous  (in  both  x  and  y) . 

Thus  the  computation  of  the  partial  derivative  of  a  singular  value  which 
was  performed  in  Chapter  9  can  be  rigorously  justified.  The  above  theorem 
also  shows  that  the  orthogonal  projections  onto  the  singular  subspaces 
possess  continuous  partial  derivatives  of  all  orders.  As  pointed  out 
earlier,  the  choice  of  singular  vectors  is  not  unique.  Kato  [32]  and 
MacFarlane  and  Hung  [60]  each  give  a  method  for  constructing  singular  vectors 
First  Kato's  method  will  be  discussed. 
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Let  ll^(x,y)  be  the  right  singular  subspace  of  M(s*x  +  jy) 

corresponding  to  the  singular  value  o^Cx.y).  Assume  that  a satisfies  (i) 

and  (ii)  or  (ii) * .  Then,  there  exists  a  right  singular  projection  Pu^(x,y) 

satisfying  (2)  or  (2)’.  Let  the  columns  of  U . (x  ,y  )  be  a  set  of  right 

loo 

singular  vectors;  i.e.,  an  orthonormal  basis  for  ll^x^.y^.  Then, 

[32,  pp.  110-114],  the  columns  of  U^(x,y)  defined  by 

U±(x,y)  *  W(x,y)Ui(xo,yo) 


W(x,y)  -  [I  -  (P  <x,y)  -  Pu  (*0»y0))  1 


(G.9) 


•  [P  (x,y)P  (x  ,y  )  +  (I  -  P  (x,y)  )  (I  -  P  (x  ,y  )  ] 
1  u^  u^  o  Jo  u^  u_^  o  Jo 

are  a  set  of  right  singular  vectors  spanning  the  right  singular  subspace 
lli(x,y).  Note  this  formula  is  only  guaranteed  to  be  valid  locally;  this 
is  because  the  condition 


(P^Cx.y)  -  Pu.Cx0,y0)|2  <  1 


(G. 10)  £ 


must  be  satisfied.  This  condition  implies  that  the  right  singular  subspace 
ll^(x,y)  can  contain  no  vector  orthogonal  to  the  subspace  lL(xo,yo).  If 
(G.10)  is  satisfied  then  the  square  root  in  (G.9)  is  defined  by  [32,  p.  34] 


where 


(I  "  A) 


*  rn  <-A>° 

n=Q  \  / 


(G.ll) 


^  2)("  2  ~  2 


-  7  -  (n-1) ) 


(G. 12) 


- 
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The  set  of  singular  vectors  (G.9)  is,  of  course,  not  unique;  one  can  multiply 
U^(x,y)  by  any  other  unitary  matrix  which  leaves  the  subspaces  ll^(x,y)  and 
U^(x,y)  invariant.  If  o^fMCx  ,yo)]  0,  then  a  set  of  left  singular  vectors 

can  be  obtained  from 

ViCx,y)  -  -  MCx,y)Ui(x,y)  .  (G.13) 

If  Cii)  is  satisfied,  then  the  singular  vectors  so  defined  are  C°°;  if 
(ii)'  is  satisfied,  the  vectors  are  analytic  in  (x,y) .  This  shows  it  is 
possible  to  choose  these  vectors  to  have  continuous  partial  derivatives  of 
all  orders. 

MacFarlane  and  Hung  [60]  propose  a  method  for  constructing  singular 
vectors  analytic  in  (x,y)  when  the  singular  value  has  multiplicity  one.  The 
construction  is  again  valid  only  locally,  although  this  is  not  pointed  out 

J 

in  [60].  (The  construction  assumes  a  matrix  in  C  has  i- 1  linearly 
independent  rows  and  that  these  are  the  first  £.-1  rows.  This  latter  assumption 
can  be  made,  without  loss  of  generality,  at  a  point,  and  continuity  implies 
it  holds  in  a  neighborhood.  However,  a  more  general  claim  cannot  be  made. 

It  appears  that  the  same  difficulty  is  present  here  as  in  the  construction 
of  Kato.  Namely,  the  singular  subspace  U^(x,y)  may  become  orthogonal  to 
Ui(xo,yo)-  In  Chapter  10  the  ability  to  construct  a  set  of  singular  vectors 
which  form  a  basis  for  a  given  singular  subspace  and  which  are  continuous 
is  studied.  It  is  shown  that,  in  general,  this  is  possible  only  locally; 
thus,  the  deficiency  in  Kato's  and  MacFarlane  and  Hung's  constructions 
cannot  be  overcome.) 


An  alternative  approach  to  the  construction  of  sets  of  singular 
vectors  can  be  based  on  other  results  in  Kato  [32].  These  results  can  be 


used  to  give  singular  vectors  along  a  curve  y(t)  *  x(t)  +  jy(t)  in  the 
complex  plane.  This  construction  is  valid  only  along  a  path  and  not  over 
an  open  set  in  the  plane.  However,  the  construction  has  other  convenient 
features;  no  assumption  similar  to  (G.10)  need  be  made,  and  the  number  of 
distinct  singular  values  is  allowed  to  change. 

While  Theorem  G.5  used  no  special  properties  of  Hermitian  matrices 
the  next  result  does;  these  properties  are  necessary  to  be  able  to  treat 
points  where  the  number  of  distinct  singular  values  changes.  These  results 
were  presented  in  [10]  but  are  repeated  here  for  completeness. 

Theorem  G . 6 i  Let  M(s)  be  a  matrix  taking  values  in  Cn*n  whose  elements  are 
functions  of  the  complex  variable  s  =  x  +  jy.  Let  y(t)  =  x(t)  +  jy(t) , 

"  ”  ^  t^  <  t  <  tj  <  +  ®,  be  a  curve  in  the  complex  plane.  Assume  that: 


(i)  y(t)  is  analytic  in  t;  i.e.,  at  each  value  of  t  £(t..,t..) 

o  i  z 

there  exists  a  power  series  expansion  with  nonzero  radius  of  convergence 


y(t)  *  I  c.  (t-t  )' 
i-0  1  ° 


(ii)  Each  element  (s)  is  analytic  in  s  at  all  points  of  the 
image  of  y(t).  Let  ^^(t^.tj)  and  let  a(tQ)  be  a  nonzero  singular  value 
of  M[y(tQ)]  with  multiplicity  k.  Then  there  exisfs  an  open  neighborhood  N 
such  that  t  6S  C  (t.,t0)  and  analytic  functions  of  t 


w  s  1 


* 


■ 


rV- 


r. 


i 


I 


i. 
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a ^  :  N  C  1R+ 

:  N  C  1+  (n 

:  N  C  ]R+  C°  ,  £-l,...,k 


such  thaC 

(1)  0^(0,  t  £  N,  is  a  singular  value  of  M[y(t)]  with  left 
singular  vector  v^(t)  and  right  singular  vector  u^(t). 

(2)  at(t0)  -a(t0). 

(3)  The  columns  of  the  matrices 


ua(t)  -  [^(0)  |  j  \(°)] 

and 

va(t)  ^  [v^O)  i  •••  j  vk(0)  ] 

are  respectively  orthonormal  bases  for  the  right  and  left  k-dimensional 
singular  subspaces  and  of  M[y(to)]  corresponding  to  a(tQ)  .  ■ 

Note  there  is  no  assumption  that  the  number  of  distinct  singular 
values  is  constant  along  y(.t).  Thus  care  must  be  taken  in  numbering  the 
singular  values  and  vectors  near  exceptional  points,  (i.e.,  points  where 
the  number  of  distinct  singular  values  changes) .  For  this  reason  the 
singular  values  in  Theorem  G.6  are  not  numbered  according  to,  say, 
decreasing  order  of  magnitude.  Note  also  that  there  exist  singular  projections 

P  and  P  which  are  analytic  in  t.  Further  discussion  is  found  in  [10]. 

o  o 


L 


Proof  of  Theorem  G.6:  Assumptions  (i)  and  (ii)  show  that  the  matrix  M  is 
analytic  in  t .  Thus ,  M  can  be  expanding  in  a  power  series 


M(t) 


r 

i-0 


MiCt-t/ 


(G.15 


By  a  construction  similar  to  that  in  the  proof  of  Theorem  G.5  there  exist 
matrices  M^Ct)  and  M^Ct)  which,  for  real  values  of  t,  are  equal  to 
M(t)[M(t)]H  and  [M(t)]**M(t),  respectively.  By  Kato  [32,  p.  139,  Theorem  6.1] 

and  M^(t)  are  analytic  at  each  real  value  of  t. 

Thus  the  nonzero  singular  values  of  M(t)  are  also  analytic  at  real  values 
of  t.  The  result  in  Kato  also  shows  the  singular  projections  are  analytic 
in  t  and  on  page  140  of  [32]  sets  of  basis  vectors  for  the  projections  are 
constructed.  These  basis  vectors  are  analytic  in  t  and  are  obtained  as  the 
solution  of  differential  equations  along  y(t).  One  can  use  this  construction 
to  obtain  a  set  of  right  singular  vectors;  the  left  singular  vectors  are 
then  given  by 

\(t)  *  . 


the  eigenvalues  of  M^(t) 


Thus  singular  vectors  with  the  desired  properties  exist; 
uniqueness  is  not  guaranteed,  of  course.  ■ 

Note  that  derivatives  of  all  orders  of  the  functions  o„,  u„  and  v. 

I  l  I 

with  respect  to  t  exist  and  are  continuous  in  t.  By  letting  y (t)  *  xq + jyQ +  t 
and  y(t)  =  xq + jyQ + j t,  the  partial  derivatives  of  the  singular  values  and 
vectors  with  respect  to  x  and  y  can  be  evaluated  at  s  *  x  +  jy  .  These 


partial  derivatives  are  guaranteed  to  be  continuous  only  along  the  respective 
curve  yCt) ,  however.  In  particular,  mixed  partial  derivatives  do  not 
necessarily  exist.  Of  course,  if  sq  is  not  an  exceptional  point,  (i.e.,  a 
point  where  the  number  of  distinct  singular  values  changes)  continuity  in 
both  x  and  y  and  the  existence  of  mixed  partial  derivatives  of  all  orders 
follows  from  Theorem  G.5.  The  behavior  of  singular  values  and  vectors  near 
exceptional  points  will  need  further  study  if  it  is  desired  to  extend  the 
results  of  this  thesis  (in  particular,  the  Bode  integral  relations)  to 
more  general  situations. 

The  results  of  Theorems  G.5  and  G.6  can  be  used  to  obtain  formulas 
for  the  partial  derivatives,  with  respect  to  x  and  y,  of  the  singular  values 
and  vectors.  These  formulas  were  stated  in  Chapter  9  for  the  case  of 
distinct  singular  values,  but  are  here  derived  rigorously. 

Theorem  G.7 ;  Let  M(s)  satisfy  the  assumptions  of  Theorem  G.6.  Let  a(sQ)  be 

a  singular  value  of  M(sq)  with  multiplicity  k.  Let  y(t)  *  +  I  be  a 

curve  satisfying  the  assumptions  of  Theorem  G.6  such  that  y(tQ)  =  sq, 

t.  <  t  <  t_.  Define 
i  o  L 


ECt) 

■  diag[a1(t)  | 

•••  i  ak(t)] 

(G.  16 

-  [ux(t)  |  ••• 

:  u^oi 

(G .  17 

and 

v*> 

■o 

r— l 
< 

H* 

rt 

V-/ 

• 

• 

!  Vt)! 

(G.  18 

where  the  a„ ,  u„ 

0  '  0 

and  v  are  guaranteed  to 

exist  by  Theorem  G.6. 

Then, 

(G.19) 


o  dt  “cJ 


where 


Se[A}  -  y[A  +  AH] 


Since  the  functions  a^(t),  u^(t)  and  v^(t),  are  equal 

to  k  of  the  singular  values  and  vectors  of  M(s)  along  y(t),  it  follows  that 

the  partial  derivatives  of  these  singular  values  and  vectors  can  be  computed 

from  those  of  these  functions.  Thus,  setting  Yx(t)=*so  +  t,  t€  (-e.e)  and 

Yy(t)  ■  sq  +  jt,  t€(-e,e),  allows  the  computation  of  the  partial  derivatives 

3a  3u  3v  3a  3u  3v 

{_3^  ’  ’  ”37}  and  {17  ’  1?  '  17}  ’  respectively .  Moreover,  this 

construction  shows  that  continuous  partial  derivatives  in  one  variable  of 
all  orders  exist.  For  singular  values  of  multiplicity  one,  a  construction 
using  the  results  of  Theorem  G.5  shows  that  continuous  mixed  partial 
derivatives  of  all  orders  exist. 

Proof  of  Theorem  G.7:  Properties  of  the  singular  values  and  vectors  imply 
that ,  for  real  t , 


U  ®(t)MH(t)M(t)U  (t)  -  S2(t)  =  0 


It  follows  from  (G.21)  that  (dropping  dependence  on  t) 


(G.20) 
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dU 


H 


u  £*  +  A  H  fa 
dt  a  a  dt 


+  U 


H 


dA 


(G.23) 


U  - 


dE 


a  dt  a  dt 


2  2 

At  t=t  ,  E  (t  )  *  a  (t  )I,  .  .  Thus,  the  first  two  terms  of  (G.23) 
o  o  o  k*k 

yield: 


dU  H  0  /y  -j  dU 

U  E2  +  E^I  H  -r^- 
dt  a  a  dt 


From  the  identify 


a2(to) 


dU 


H 


a  H 

FUa+Ua  IT 


t=t 


(G.24) 


t*t 


“a"11®  S  *kxk 


it  follows  that 


dU  H  _  dU 

a  U  +  U  H  a 


dt  a  a  dt 


H  dD0 
2Se[U.H  -T^] 


a  dt 


(G.25) 


=  0 


Substituting  (G.25)  into  (G.23)  and  using  various  properties  of  singular 
values  yields : 

dE2  H  dM^M 
dt  =  a  dt  Uo 

dt  a  dt  a  a  dt  a 


(G . 26) 


n  r  H  dM  . 
■2EReIvo  dFua! 

dE  _  ,  H  dM  „  . 

3F  -Be[va 


which  was  to  be  proven. 
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It  should  be  pointed  out  that  the  derivatives  -r—  can  be  calculated 

d£ 

using  any  orthonormal  basis  for  the  right  singular  subspace  and  the  correspond 
ing  set  of  left  singular  vectors.  Thus,  at  exceptional  points  sq  m  y(tQ) 
there  is  no  need  to  determine  precisely  which  basis  of  the  right  singular 
subspace  of  M(sq)  yields  an  analytic  set  of  right  singular  vectors  along 
y(t).  This  is  discussed  in  detail  in  Freudenberg,  Loose  and  Cruz 
[10,  Theorem  2]. 

Explicit  formulas  for  the  derivatives  of  the  singular  vectors  can 

also  be  obtained.  First,  some  additional  notations  must  be  introduced. 

Let  the  number  of  distinct  singular  values  of  M(sq) G  ®n*n  be 

given  by  N.  Let  the  singular  values  be  denoted  by  o  . ,i*l,...,N,  where 

N 

the  multiplicity  of  o.  is  given  by  k,  .  (Note  I  k.  *  n.)  Then  there  are 

i*l  1 

N  right  singular  subspaces  ll^i-l, . . .  ,N  with  dimCU^-k^. 

Theorem  G.8:  Let  M(s)  satisfy  the  assumptions  of  Theorem  G.6.  Let 

y(t)  :  [t^,t2] -*•  <E  be  a  curve  satisfying  the  assumptions  of  Theorem  G.6 

such  that  y(t  )  =  s  ,  t,  <  t  <  t-.  Define 
o  o  1  o  Z 

£±<t)  -  diag[ail(t)  i  •••  !  clk  ]  (G.27) 

ui(t)  "  [uii(t)  i  •••  i  «ik  <*>]  (G.28) 

and  a  i  i 

Vi(t)  -  [vu(t)  j  •••  i  vik  (t)3  (G.29) 

i 

where  the  u^^  and  v^  are  guaranteed  to  exist  by  Theorem  G.6.  Then  the 

derivatives  of  and  can  be  written 


Proof :  Using  various  properties  of  the  singular  value  decomposition  and 
dropping  the  dependence  on  t : 


MHMUi  -  UiZi: 


dM8*! 


dU, 


dt 


u.  -TT-  +  -ri  e/ 


i  dt  dt  i 


For  m^i 


U 


H 


dA 


m  dt  i 


0H^t  2 

m  m  dt  ~ 


'm  dt  “i 


(G 


2  2 

Evaluating  (G.38)  at  t  yields  E  ■  a  I,  and  E.  ■  a, I,  ,  which  in  turn 

o  in  m  k  i  i  k, 

m  i 


implies 


u  ■»  fk .  (  l  \  „  i 

"  dt  \ct2  -  cn2/  ■ 


H 


dA 


dt  i 


(1  \  T  H  dM8  H  dM  “I 

— o - 7  )  U  3—  V.o,  +  a  V  v. 

(j2_j2y|<n  dt  ii  mmdtxj 


m 


Thus,  CG.38)  is  proven  and  (G.33)  follows  similarly.  Equations  (G.34)  and 
(G.35)  follow  from  (G.25).  Finally,  from 


v.V 


i  •*!  ■  2i 


dV 


H 


3T  Vi  +  2i“iB  7T  +  viB  S  °i 


dE  j 


dt 


(G 


At  t»tQ,  E^  *  *  ^is  fact»  and  rearranging  terms  in  (G.39)  yields 


Li  *"±1 


Li  dt  i  dtl 


-  *«u 


which  proves  Che  result.  ■ 

The  above  theorem  again  shows  Che  existence  of  a  degree  of  freedom 

H  dVf  H  ^i 

in  the  singular  vectors:  -jt-  and  can  be  chosen  as  arbitrary 

functions  analytic  in  t,  subject  to  (G.36).  Note  also  that  the  comments  which 
were  made  following  Theorem  G.7  concerning  the  partial  derivatives  of  the 
singular  values  with  respect  to  x  and  to  y  also  apply  to  the  singular  vectors. 

Recall  the  construction  of  right  singular  vectors  used  in  the  proof 
of  Theorem  G.7.  This  construction,  based  upon  the  results  of  Kato  [32,  p.  140] 
yielded  a  set  of  singular  vectors  analytic  in  t,  satisfying  a  differential 
equation.  It  is  interesting  to  investigate  the  choice  of  singular  vectors 
obtained  from  this  procedure. 

For  simplicity  assume  that  the  singular  values  are  distinct  at  sq. 

Let  y(t)  be  a  curve  through  sq  and  assume  M(s)  and  y(t)  satisfy  the  assumptions 
of  Theorems  G.6-G.8.  Define 

U(to)  £  [O^tp'l  •••  !  Un(tQ)]  (G.40) 

where  Ui(tQ)  is  a  unit  vector  in  the  right  singular  subspace  U^Ct  ).  A 
transformation  function  W(t)  can  be  constructed  so  that,  locally, 

l^Ct)  -  WCOU^)  (G.41) 

is  a  right  singular  vector  in  the  subspace  U. (t) .  Let  P. (t)  =  P  (t) 

1  ^  K 

denote  the  orthogonal  projection  onto  the  subspace  U,  (t)  and  define 


where 


Q(t)  =  Z  P'(t)P.Ct) 
k-1  k 


"  dc  [Pk<C)1 


It  can  be  shown  that  W(t)  is  the  solution  of  the  differential  equation 


£  WCt)  -  Q(t)wCt) 


with  initial  condition  W(t  )  *  I.  From  (G.41)  and  (G.44)  it  follows 

o 


du  (t) 

-~dT~  *  Q(t)ui(t) 


Hence,  supressing  dependence  on  t: 


B  dUi  H  - 

ui  IT  ’  ui  9“i 


“i  *  'kVi 

k=l 


Since  P^u^  *  u^  and  P^u^  *  0,  k^i,  CG.46)  reduces  to 

H  d“i  H  , 

"l  "dt”  U1  Pl“i 


Now,  for  real  values  of  t,  P^(.t)  *  [Pi ( t ) ]  ;  i.e.,  P^(t)  is  Hermitian. 

Moreover,  so  is  P^(t).  Thus,  u^  P^u^  is  a  real  number.  But  since 
H  ^ui 

Refu.  ”T”]  ■  0,  it  follows  that  both  sides  of  (G.47)  equal  zero.  Thus, 

1  at 

using  Kato's  procedure  yields  right  singular  vectors  with  the  property  that 
H  ^ui 

u^  0.  The  constraint  (G.36)  implies  that  the  left  singular  vectors 


given  by 
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2  n  u  H  dV1  n  u  3u< 

7  logaj  -  2  ^  I»(  (»k  -^)  Cv“  -^)  -  (u^  -ji)  -£> } 


For  kjti. 


H  3ui 


"k 

3x 

, 2  ^ 
(oi  -  v 

H 

3ui 

1 

\ 

3x 

, 2  ^ 
<0i  -  V 

Using  Lemma  9.6: 

H 

11 

3ut 

■  ■  m  ■ 

_ -J 

H 


SM* 


H  3M 


1  r  .  H  3M  H  3M^ 

1  - T~  t04  v.  —  u,  +  a,  u.  — r—  v.  } 

2  2.  i  i  3x  k  k  i  3x  k 

i  "  ak) 


3x 


2  2 
°i  “  ak 


r  H  3M  H  3M®  , 

{ai  vi  17  \  "  °k  Ui  17  V 


Thus, 


<\  ^  ^  f  f  «t 


2,  H  3M  ..  1 2  2,  H  3M  „  ,2 


,  „  „  T  r,  H  3M  w  H  3M 
+  atak2j  im[(Vi  —  Ui)]} 


-1 


,  2  2,2 

<0i  -  V 


,  2  i  H  3M  |  z  l  |  h  dn 

tai!vi  17  "it I  -\lvkl7 


2  |  H  3H  | 


Similarly,  it  can  be  shown  that 


H  ^Vi  H 

Imt  <vk  1J>  (vk  17> 1 


H  3M 
'k  3y 


• 
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Using  (H. 7)  and  (H.8)  in  (H.6)  yields 

u2i  o  r  it  H  3Vj-w  H  /  H  3ui.  .  H  9ui., 

V  log  at  «  2  Im{  (vt  — )  -  (u±  — )  (u± 


.  >  „  1  ,  .  H  3M  1 2  i  H  3M  i2, 

+  2  1  ~2 - +l\37Ui|} 


ki*i  o±  -  ak 


(H.9) 


Let  the  first  term  on  the  right-hand  side  of  (H.9)  be  denoted  (I).  Then 
from  (9.22)  and  Lemma  9.6 

(I)  -  2  ImtvJ  fi  uj  +  (uj  lm[v“  |*  uj  +  CuJ  -ji)) 

.  H  3ui.  H  3ui  , 

-  <"i  (ui  17> 1 

■  2  W-4  I.*?  f  V  utf  f  .t]  ♦  i-  f  V 


,  ,  H  dUi.  -j  T  .  H  3M  , , 


h  3ut 


h  3ui 


Since  — —  and  — —  are  both  purely  imaginary  numbers,  it  follows  that 

(I)  equals  zero.  Using  Lemma  9.6  one  more  time  in  (H.9)  yields 


„2,  „  1  r  i  H  3M  i2  .  i  H  3M  ,2 

7  108  31  *  *  — — 2  ( lvi  57  +  iyi  37  ukl 

W!  at  -  ck 


,  |  H  3M  1 2  |  H  3M  1 2, 

+  K  57"il  +  lvk1Tuil  > 


which  was  to  be  proven. 


«\  •*..  -r.  wV-f-  A y«  ■-  <.  •% 


v*’>' 


LW  f  >i: 
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APPENDIX  I 


PROOFS  OF  THEOREMS  IN  CHAPTER  12 


A  1  Cl 

Proof  of  Lemma  ,l2.1:  For  each  a€[-ir/2,ir/2] ,  define  M^ct)  ■  lim  M(ReJ  ). 

R  00 

Since  M(s)  has  proper  rational  entries,  it  follows  that  M^Ca)  exists  and  is 
independent  of  a.  Thus  define  ■  Moo(a) . 

First,  suppose  that  M^  has  rank  n.  Then  (12.7)  is  satisfied, 
for  i-1, . . .  ,n,  with  k.j-0  and  c±  -  ai[Mo8].  Suppose,  in  addition,  that 
a^lM^]  i*  aiJMoJ  ^or  T*1®11  (12.8)  is  satisfied  with  the  and  E ^ 

given  by  the  left  and  right  singular  subspaces  and  U^M^]  and  with 

the  w±  and  z±  a  pair  of  singular  vectors  v^M^]  and  u^(Mw] . 

Next,  suppose  that  has  rank  a  <  n.  Then  (12.7)  is  satisfied 
for  i-l,...,«  with  k£-0  and  c£  -  ^[Mj.  Suppose,  in  addition,  that 
a. [M  1  +  a,  [M  ]  for  i/k,  i  <  m,  k  <  m.  Then  (12.8)  is  satisfied  by  the  left 
and  right  singular  vectors  v^ [Mb]  and  u^fM^],  i*l,...,m.  Note  that  M(s)  may 
be  decomposed  as 

M(s)  ■  M  +  ^  M.(s) 

00  s  i 

where  M^s)  is  proper.  Define  P^  to  be  the  orthogonal  projection  onto  the 
subspace  9^  C  (En  spanned  by  {v^M^] , . . .  .v^tM^]  } .  Define  Pu<jo  to  be  the 
orthogonal  projection  onto  the  subspace  Uw  C  (En  spanned  by 
{u, [M  ],..., u  [M  ]}.  Similarly,  let  the  projections  onto  the  orthogonal 

i  00  m  00 

complements  of  and  be  denoted  by  P^  -  I  -  and  *  I  -  Puoo. 

By  continuity,  there  exist  m  singular  values  and  singular  subspaces 
ai[M(s)],  Di[M(s)],  and  U^MCs)]  such  that  Va€  [-it/2, tt/2]  ,  and  i«l,...,m 


a.[MJ  -  lim  a  [M(Re  )] 
R  +  ® 


V  [Mj  *  lim  U  [M(Reja)  ] 

R-*-oo  L 


U  [MJ  -  lim  U  [M(Reja)] 
R-*-® 


(Note  that  9^  ®  U  V  [Mj  and  11^  ®  U  II  [M  ] . )  Again,  these  limits  are 
i*l  i*l  m  m 

independent  of  a.  Similarly,  define  U  (s)  *  U  U  (s) ,  0  (s)  ■  U  ij  (s) , 

j.  i-1  ,  "  i-1 

and  projection  operators  PvJs)  ,  PuJs) ,  PvJs)  and  PuJs)  .  Define 


M.(s)  *  (s)M.  (s)P1  (s) 

1  V®  1  u® 


M  -  lim  M-  (ReJ  ) 
R -*■<*> 


where  the  limit  is  independent  of  a€  [— tt/2 ,tt/2]  .  Then  the  n-m  singular  values 
of  M(s)  which  approach  zero  as  |s|  -*•»  satisfy  Vct€  [-tt/2,t/2]. 


U»  Ro^CReJ0)  - 


i=l , . . . ,n-m 


To  show  this,  recall  that  by  definition 


Vi[M(s)1  =  llM<s>pu.(s>u 

B  u«  “1 


lPt«(s)M(s)Pu®(s)ul 

II  ull  *1 


which  follows  since  MCs)?1  (s)uSl/  (s)  for  u^ll^  (s) . 


°nrt-l[M(s)1 


"  .“f*  Iv(s)CM»  +  7mi(s)1pu»(s)uI2 


>  l2lPtco(s)M«piu«<s>^  --jTf  5IV-)1I 

and 

lim  Ra  [M(Rej“)]  >  o[MjJ 
R  -*• «® 

since,  by  definition,  lim  (Re^a)M  P-*"  (Re^a)  *  0  for  all  aS  [-ir/2,ir/2] . 

^f<X>  00  jJflO 

Moreover ,  from 

Vx[M<3)1  i  ;tpt-(3)M-pu-(s)1  +]7[5[]Si(s)) 

it  follows  that 


lim  Ra  [M(s)]  =  o^J 
R 


Similarly,  it  may  be  shown  that 


lim  Ra 
R  -*■« 


m+i 


[M(S>] 


i*l )  i  •  •  jU^ni 


Let  have  rank  (note  <  n-m) .  Then  (12.7)  is  satisfied  by  setting 

km+i  “  1  and  Cm+i  “  ai^l»]  for  i=1-'--*mr 

Suppose  that  the  nonzero  singular  values  of  are  distinct.  Then 

(12.8)  is  satisfied  by  left  and  right  singular  vectors  v^  =  v  [ft^]  and 
ui  “  ui^l«^  for  •  •  •  »m]_*  That  is 

*  lim  Re^av^  M(Re^a)u^ 


If  rank  ■  n-m,  then  the  above  procedure  gives  the  values  of 

and  in  (12.7)  and  the  z^  and  w^  in  (12.8)  for  all  i*l,...,n.  If 
<  n-m,  then  there  exist  singular  values  for  which  (12.7)  is  satisfied 
for  k^  >  2.  The  above  procedure  can  be  repeated  by  writing 


M, 


«  1 
M,  +  —  M_ 
1“  s  2 


where  M2  is  proper.  The  singular  value  decomposition  of  may  be  used  to 
obtain  projection  operators  onto  the  nullspace  of  and  its  image.  These 
in  turn  may  be  used  to  define  M2(s)  similarly  to  M^s).  Since,  (a)  M(s)  is 
rational,  (b)  det  M(s)  t  0,  and  (c)  only  n  singular  values  exist,  this 
procedure  may  be  repeated  until  values  of  k^,  c^,  w^,  and  z^  are  found  for 
all  i=l, . . . ,n.  ■ 

Before  proceeding  with  the  proof  of  Theorem  12.2  it  is  necessary  to 
present  some  additional  Definitions  and  Lemmas . 

Definition  1.1:  Let  g(s)  ■  g(x,y)  and  h(s)  *  h(x,y)  be  real  valued  functions 
of  the  complex  frequency  variable  s-x+jy.  Then  f(s)  =  g(s)  +  jh(s)  is  said 


Equivalently, 


f  (s) 


(I.lc) 


Lemma  1.2:  Assume  that  f(s)  -  g(s)  +  jh(s)  is  conjugate  symmetric,  and  that 
g(x,y)  and  h(x,y)  have  continuous  partial  derivatives.  Then 


la 

.  la 

9x 

9x 

(x,y) 

Cx,-y) 

la 

..la 

3y 

9y 

(x,y) 

(x,-y) 

9h 

9h 

9x 

3x 

(x,y) 

(x,-y) 

9h 

,  Jh 

1 

9y 

9y 

(x,y) 

(x,-y) 

(I. 2a) 


(I. 2b) 


(I. 2c) 


(I. 2d) 


Proof :  By  definition. 


where  Ay'  *  Ay.  Proof  of  Equations  (I. 2c)  and  (I. 2d)  follows  similarly. 

■ 

Lemma  1.3:  Let  M(s)  satisfy  the  assumptions  of  Theorem  12.1.  Then  each 
singular  value  and  its  associated  measure  of  phase  difference  (12.3)  have 
the  property  that  f(s)  =  logo(s)  +  j[9(s)-0(O)]  is  conjugate  symmetric. 

■ 

Proof:  Let  the  singular  value  decomposition  of  M(s)  =  M(x,y)  be  given  by 

M(x,y)  -  V(x,y)I(x,y)ru(x,y)]H  .  (1.3) 

Then  each  singular  value  may  be  written  (suppressing  the  subscript)  as 

a(x,y)  =  [v(x,y) ]HM(x,y)u(x,y) .  The  left  and  right  singular  vectors  v  and 
u  may  be  written  as  in  (12.2),  yielding 

a(x,y)  exp[jA9(x,y) ]  =  [pv(x,y) ]HM(x,y)pu(x,y)  .  (1.4) 

Since  rational  functions  are  conjugate  symmetric,  it  follows  that 

M(x,y)  *  M(x,-y).  By  assumption  p^  and  p^  are  also  conjugate  symmetric. 

Thus  (1.4)  implies  that  o(x,y)  =  o(x,-y)  and  exp[j9(x,y)]  *  exp [-j 9 (x,-y) ] . 

Hence  9(x,y)  *  -9(x,-y)  +  2k:r  and  9(x,0)  =  kit,  with  k  even  if 

exp[j6(x,0)]  *  1  and  odd  if  exp[j9(x,0)]  *  -  1.  Finally,  the  assumptions  of 

Theorem  12.2  yield  that  9(x,0)  is  constant  in  x.  Denoting  this  constant  by 

9(0)  yields  the  result.  ■ 

2 

Next,  if  C  is  identified  with  K  ,  a  basis  for  the  cotangent  space 
to  a  point  s»x+jy€C  is  given  by  {dx,dy}.  A  complex  basis  for  the  cotangent 
space  is  given  by 


{ds  ■  dx  +  jdy,  ds  ■  dx  -jdy) 


The  dual  basis  for  the  tangent  space  is  given  by  [62,  p.  2] 


,J_  A  1  ,J_ 
l3s  2v3x 


ia(i  +  j  j.)} 

2'3x  3 


3s 


3y 


(1.5) 


Let  g(s)  *  g(x,y)  and  h(s)  -  h(x,y)  be  C  in  (x,y) .  Then  f(s)  -  g(s)  +  jh(s) 

3f 

is  analytic  iff  —  »  0,  where 
3s 


(1.6) 


3f 

Thus  the  condition  »  0  is  equivalent  to  the  Cauchy-Riemann 

equations  being  satisfied.  Note  also  that  Lemmas  1.2  and  1.3  imply 
3f 

—  is  conjugate  symmetric.  Finally,  note  that 


dsAds  ■  -  2j  dxAdy 


(1.7) 


It  will  be  convenient  to  have  an  expression  for  the  basis  (1.5)  in  terms  of 
polar  coordinates  on  the  closed  right  half  plane.  Let  ja  be  a  point  on  the 
joj-axis.  Then  a  polar  coordinate  system  is  given  by 


s  »  ja  +  re^a 


r€  [0,») 

ct€  [ -tt/2, tt/2] 


(1.8) 


ds  •  e^a(dr  +  jr  da) 


Thus, 


(1. 9s 


3s  2  ''ar  r  aa' 

_L.IeJ«(J_  +  i_L) 

31  2  v3r  r  Set' 


(I. 10a) 


(I. 10b) 


dsAds  «*  -  2  jr  dr  Ada 


(I.  ID 


In  polar  coordinates,  (1.6)  is  given  by 


Ieja(il+  111) 

2  ^3r  r  3a; 


IeJa[(i&+  j  111)  +i  (J*+  <  ill)] 

2  u3r  J  3r'  r  '•aa  J  3a' J 

—  e^a  r  (la.  _  i.  Hi)  +  4  (Hi  +  i.  la.)  i 

2  U3r  r  3a'  3*3r  r  3a'  J 


(1.12) 


Proof  of  Theorem  12.2:  Consider  the  closed  curve,  C(R,e) ,  traversed  counter¬ 
clockwise,  pictured  in  Figure  1.1.  Denote  the  interior  of  C(R,e)  by  D(R,e), 
the  large  semicircle  by  C^,  the  small  semicircle  by  C£,  and  let  and  C£  be 
as  pictured  in  Figure  1.1.  Let  D (R,e)  have  the  orientation  given  by  the 
standard  basis  for  the  tangent  3pace  to  points  in  IR  .  Then  Stokes'  Theorem 
[49,  p.  124]  shows  that,  letting  f ( s )  *  logo(s)  +  j9(s). 


/  ds  *  -  /  |l ■  ( — — )dsAdI 

C(R,e)  S-ju  D(R, e)  3s 


(1.13) 


The  proof  will  be  broken  into  several  steps. 

Step  1:  Take  the  limit  as  e  ■+•  0  in  (1.13).  First,  consider  the  curve  Cg 
parameterized  by  s(t)  ■  jw  +  ee^*^  TTt^ ,  0  <  t  <  1.  Then 

ds  ”  ggj  Tt)  /_4  t'S  A  l-  -.~-4  4 


(-jt)dt  and  continuity  implies 


lim  /  — r^-  ds 

e-H)  C  8“3“ 
e 


-  j  IT  /  f  (jcj  +  ce^  ^ir'2_Trt^)dt 

0 

-  f(juj) 


(1.14 


Thus , 


/  ISll.  ds  *  lim  /  H§2.  d s 

C(R,0)  S“jaJ  e-K)  C(R,e)  S-J“ 

-  -  i •  *0.)  +  / 

CR 

urt-e  .  -R  . 

+  lim  [/  f(jy)  ^  +  /  f(jy)  T^J 

e-K)  R  y  w-e  y 


(1.15 


-K 

Define  /  f(jy)  —^7  by  the  limit  in  (1.15),  provided  this  limit  exists  and  is 
R 

finite.  The  fact  that  the  limit  indeed  exists  and  is  finite  will  be  proven 

by  showing  that  lim  of  the  right  hand  side  of  (1.13)  exists  and  is  finite. 
e-K) 

First,  define  a  polar  coordinate  system  on  the  closed  right  half 

plane  by 


s  -  x+jy 


«  jw  +  reJ 


re  [0,») 

ae  [ -it/2, fl/2] 


Since  the  integrand  of  (1.16)  is  independent  of  e,  it  follows  that 


/  (e“Jo)drAda  -  lio  /  ~  (e-ja)drAda 

D(R,0)  dS  e->0  D(R,e)  38 

exists  and  is  finite.  Thus  the  limit  in  Cl. 15)  has  these  properties 
Rearranging  yields 


CR  R 


+  -r~  /  |=-  ( — — )dsAds 

D(R,0)  98  S_:,a> 


f(jy) 


_di_ 

y-w 


Similarly,  it  may  be  shown  that 


-R 


-  F  1  f(s)(^fe)ds  *JW  l 

Cr  R 


+  TT  /  |f  )dsAds 

j7r  icR.O)  3s  s+^ 


Thus  subtracting  (1.18)  from  (1.17)  yields 


2j[e(M  -6(0)]  -  i/  f(.)  t-l-.-i-H, 


'R 

-R 


+  F  l  f<Jy>  -  7fcldy 


+ £  i(i,0)  #  (^,d8M5  -  £  '  «  (^-,dsA' 


D(R,0) 


3f  (  1 
3s  's+jcu'' 


-  ,7 1  ^  -  7^dy 


-  T~  j  ||  (-r—)dsAds  +  |r  /  |4  (t~ -)dsAdi 

2,1  D(R,0)  38  S-^  2*  D(R,0)  38  S+Jw 


Step  2 :  Show  that  conjugate  symmetry  implies 

-  /  |i  (— ^-)dsAdi  +  ~  (-^r-)dsAds 

271  D(R,0)  38  8“J“  211  D(R,0)  38  S+jw 


(1.19) 


-  -  Ref/  ||  (— ~)dsAds] 

"  D(R,0)  38  8-ju 


(1.20) 


Define  the  notation 


L  *  /  |=-  ( — ^r-)dsAds 

D(R.O)  38  a“*a 


(I. 21a) 


R  -  /  (— ~)dsAds 

D(R,0)  38  S+ju) 


(I. 21b) 


and  a  polar  coordinate  system 


s  ■  jw  +  re' 


r€  [0,») 


a€  [-tt/2, tt/2] 


Then  dsAds  *  -  2jr  dr  Ada  and  — ^ a 


e  J  .  Thus 


397 


Assume  (with  no  loss  of  generality)  that  R>o>.  Then  CR  implicitly  defii.es  r 
as  a  function  of  o  via  the  equation 

2,2  _2 

x  +  y  -  R 

**  (r  cos  a)^  +  (u  +  r  sin  a)^  -  R^ 

=*  +  r(2oj  sin  a)  +  -  R^  ■  0 

fy  2  2 

*►  r  »  -  to  sina+  vR  -  w  cos  a 


Thus, 


tt/2  F(ct) 

L  -  -  2j  /  /  !=■  e“J  drAda 

-tt/2  0  38 


Similarly,  define  a  coordinate  system  by  s*-jw  +  peJ  ,  p€  [0,®), 
SS  [— tt/ 2 , tt/2 ]  and  define  G(8)  -  u  sin  8  +  A?  -  oo^  cos^  0. 

It  follows  that 


tt/2  G(S)  ,, 

R  -  -  2j  /  /  ||  dpdS 

-it/2  0  33 


Conjugate  symmetry  implies  that 


(f§)  “  (ff ) 

s-pe^8  s.pe-j3 


Then 


(1.23) 


(1.24) 


(1.25) 


tt/2  G(3)  tj-tt - 

R--2J/  /  ff«JS 

-”/2  0  35 _ -it 


Define  a  new  variable  of  integration  y  -  -  8.  Then  (1.26)  reduces  to 


(1.26) 


>  *.  •.  \  %  .  % 


398 


-  .  -  WTJKT ICtJtTK.'TTtJI  KTIt-l  H71 


P.'- 

*V.“ 

V 

K 

i 


k- 


-j/2  G(-y) 


-2  3 !  ! 

■nil  0 


ii 

3s 


e“^Y  dp(-dy) 


s»pe 


iy 


Noting 


G(— y )  ■  a)  sin(-Y)  +  /R^  -  u?  cos^Y 


f(y) 


it  follows  that 


*/2  FCY) 


"  /  *  VI  /  . 

*■-«/.  /  H«'jY 


-it/ 2  0 


dpdY 


(1.27) 


Equations  (1.24)  and  (1.27)  show  that  L  *  -  R  (since  a  and  y  are  merely 
dummy  variables  of  integration).  Thus  the  left  hand  side  of  (1.20)  reduces  to 

*  57  L  +  57  *  -  -  5T  [L  -  « 

-  -  h [L  +  E> 

-  -  i  RetL] 

which,  using  (I.21a),  proves  Cl. 20). 

From  (1.12)  and  (1.22) 

L"2>  DO R.0)  *  eJ°  •  r  +  3<f  +  7  ^Ada 


f.'. 


Using  (1.28)  in  (1.19)  yields 


where  R>co  is  assumed  with  no  loss  of  generality.  From  Lemma  12.1  it  follows 
that  as  R-*- 00  log  a  approaches  infinity  no  faster  than  log  R  and  0  remains 


bounded.  Thus  M(R)  -*•  0  as  R  +  ®.  A  standard  argument  may  then  be  used  to  show 
that  A=>  0. 

Next,  consider 

B  ’  ^  l  f(M  ~^~2  dy 

R-*-“  R  y  -to 

R  . 

-  lim  j  f  f(jy)  — 2 — j  d ? 

R -*•<*>  -R  y  -a) 

An  argument  similar  to  the  one  used  to  show  A*  0  may  be  used  to  show 

00 

B  -  7  /  f  <jy)  -r~z 

-ao  y  -oj 


Conjugate  symmetry  implies 


B  -  ~  /  log  o  *(“ j)  dy 

y  -id 


0 


Define 


l_  + i  /  0  <i  - +  i  “a****** 

R  -*■<*  D(R,0) 


CRHP 


where  the  existence  and  finiteness  of  the  limit  follows  from  (1.29)  and  the 
fact  that  A  and  B  exist  and  are  finite.  Thus,  (1.29)  reduces  to 


0(jw)  -  6(0) 


2  op 
TT 


/  log  g-Cy1— )dy 
0  u  -y 


I  r  /M  ,  i  aiogoXJ_,J. 


An  argument  identical  to  one  used  by  Bode  may  be  applied  to  the  first  integral 
in  (1.30)  to  show 


•U«>  -  «o>  -  i  /"  {log  coth  i^-}dv 


-i/  (f  +  i  )  drAdo 

^  /.«rm  3 ^  r  3a 


CRHP 


where 


v  *  log  (y/oj) 


The  fact  that 


j 


39  .  1  31ogg 
3r  r  3a 


H 


H 


u 


yields  the  result.  (Note  in  the  statement  of  the  theorem  that  cj  is  replaced 
by  u  and  y  is  replaced  by  oo.) 
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